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Look after the pennies 


Government decisions about where to spend and where to cut should be based on 


evidence, not ideology. 


hen a nation’s expenses grossly exceed its income, as they 
W cai do in the United States, the most foolish way to 

curb the resulting deficit is to slash spending wildly. Yet 
that is the path that the US government has chosen to follow with 
this year’s ‘sequester. A smarter way is to follow the path pioneered 
by evidence-based medicine: fund what works and cut what doesn't. 
That is the approach being pursued in the budget submitted last week 
by US President Barack Obama (see page 277). 

In a budget chapter rarely mentioned in the media hoopla over pro- 
posed tax rises and spending cuts, the administration has laid out a 
blueprint to implement evidence-based decision-making throughout 
the government — in effect, bringing the methods of science to bear 
on policy. 

This reform effort began under Obama's predecessor, George 
W. Bush, but it has accelerated greatly with the need to do much more 
with significantly less. The White House Office of Management and 
Budget (OMB) signalled the way that things were going in May last 
year when it instructed government agencies to incorporate evidence- 
based strategies throughout their operations. 

The White House mandate applies to every agency in the executive 
branch, including those that fund science. But the most urgent target is 
the government's vast array of social services, which range from early- 
childhood enrichment projects to the home care of elderly patients. All 
were created with good intentions; few have had their effectiveness eval- 
uated with any kind of rigour, or by anything that resembles peer review. 

The OMB memorandum suggests several ways to change that situ- 
ation. One is to fund social services through a ‘tiered’ approach not 
unlike the stages followed in clinical trials. At the lowest tier, agencies 
would allocate seed money to promising, but unproven, ideas, pro- 
vided that the research programme builds in a rigorous assessment 
of outcomes by independent investigators — usually academic social 
scientists or non-profit research firms. 

In the higher tiers, more funding would be available for programmes 
that are supported by stronger evidence (and with built-in assessment 
protocols). The highest tier would be reserved for large-scale, multi- 
million-dollar programmes that are already supported by multiple 
gold-standard, controlled trials. 

Federal agencies are already using the tiered model for six evidence- 
based programmes — ranging from teen-pregnancy prevention to 
education — with budgets totalling about US$1 billion in 2012. 
Obama's new budget proposes to boost that funding by 44% for 2014. 

Another suggested strategy, pioneered in the United Kingdom, is 
a model known as ‘social impact bonds’ or ‘pay for success. It sees 
philanthropic organizations and private companies fund preventa- 
tive services, with the government paying them back only if rigorous 
assessments show that the services save taxpayers money. US federal 
agencies have tried this approach on a small scale with job-training 
programmes and projects to reduce recidivism in newly released 


prisoners. In 2014, Obama proposes to spend up to $195 million to 

expand these initiatives into areas such as housing and education. 
Such strategies for data-driven decision-making have the potential 
to radically improve the US government's efficiency and effective- 
ness, and deserve vigorous support from Congress — with one caveat: 
both Congress and president must be equally vigorous about support- 
ing research into what success actually means and how to measure it. 
These are still open questions in most areas of policy. Most parents, for 
instance, probably think that there is more to 


“A smarter a good education than getting their child to 
way is to score well on standardized tests. It can be dif- 
follow the path ficult to quantify such intangible benefits, but 
pioneered by that is no excuse not to try. 


Everyone favours government effectiveness 
as a concept. But every existing programme 
is also someone's livelihood. When those 
judged ineffective — by whatever measure — are cut or consolidated, 
the protests and the lobbying are fierce. If officials can resist that pres- 
sure, evidence-based policy initiatives could help to bring about a much- 
needed shift in the inflamed fiscal debate, from ideology to pragmatism. 

The OMB memorandum captured that spirit in a refreshingly un- 
bureaucratic call to arms: “Where evidence is strong, we should act 
on it. Where evidence is suggestive, we should consider it. Where 
evidence is weak, we should build the knowledge to support better 
decisions in the future.” 

That is easy to say; it is harder to do. But to say it is a start. m 


evidence-based 
medicine.” 


Smoke and mirrors 


Italy’s parliament must listen to expert advice 
before deregulating stem-cell therapies. 


election of a new pope, a grimmer pall hangs over the Eternal 

City — a fog of confusion and misrepresentation about stem-cell 
therapy. Those who have lit the fire beneath the debate say that they 
are promoting the translation of stem-cell research into the clinic so 
that currently incurable diseases can be treated. Nothing could be 
further from the truth. 

The Second International Vatican Adult Stem Cell meeting, held 
on 11-13 April in Vatican City, was a shamelessly choreographed per- 
formance. Sick children were paraded for television, sharing the stage 
with stem-cell companies and scientists desperate to hawk a message 
that their therapies must be speeded to clinical use. 


Je: weeks after the white smoke from the Vatican signalled the 
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A kilometre away at the Italian senate, meanwhile, parliamentarians 
further eroded protection for vulnerable patients targeted by stem-cell 
companies. On 10 April, they amended an already controversial minis- 
terial decree (see Nature 495, 418-419; 2013) with a clause that would 
redefine stem-cell therapy as tissue transplantation, thereby releasing 
it from any regulatory oversight. If the second parliamentary chamber 
endorses this amendment, Italy will be out of step with the rules of the 
European Union and the US Food and Drug Administration, both 
of which define stem cells modified outside the body as medicines. 

Many scientists around the world were appalled by the events in 
Rome, and rightly so. It is wrong to exploit the desperation of the 
disabled and the terminally ill and to raise false hopes of quick fixes, 
as some at the Vatican meeting tried to do. It is also wrong to try to 
use such patients as experimental animals by bypassing regulatory 
agencies, as the Italian parliament seems to want to do. 

Reputable stem-cell companies insist that stringent regulatory control 
is necessary, and that patients should be exposed to experimental treat- 
ments only when safety and efficacy is assured. Failures in the clinic will 
hold back the field. But not all of the cell-therapy industry is so tolerant. 

With their ability to differentiate into various cell types, stem cells 
hold enormous potential to repair damaged tissues. Human embry- 
onic stem cells can turn into any cell type, but many groups, including 
the Catholic Church, find their derivation from embryos unethical. 

The current controversy concerns adult stem cells. These exist in sev- 
eral tissues, but can replace only those particular tissues. Big claims are 
being made for them, with many trials of therapies under way world- 
wide for conditions as diverse as Alzheimer’s and heart disease. Some 
stem-cell therapies are approved by regulatory agencies; others sneak 
under the radar by exploiting rules allowing compassionate therapy, for 
example, or by operating in countries such as China or Mexico — and 
perhaps now Italy — where regulation is less strict. 

The scientifically naive Vatican finds the concept of adult stem cells 


attractive simply because embryos are not involved — yet it ignores 
the ethical implications of false hope. 

The main organizer oflast week’s conference was the non-profit Stem 
for Life Foundation, launched by the stem-cell company NeoStem, both 
based in New York. The foundation says that it is in favour of strict regu- 
lation of stem-cell therapies. But its conference programme, which left 
no room for questions, included many speakers who clearly were not. 

It was framed as a fight for reason and fairness 


“A lot more against an uncaring and intransigent scientific 
research into community. 

the deep biology Adult stem cells have already had clinical 
of stem cells is success, such as in bone-marrow transplan- 
needed.” tation for leukaemia treatment, growing new 


skin layers to treat burns and regenerating 
corneas. More ambitious hopes need to be tempered, however. Many 
trials involve infusing patients with mesenchymal stem cells from bone 
marrow, which are relatively easy to extract and grow. These can make 
only bone, cartilage and fat cells, but trials targeting other tissues use 
the rationale that other, non-stem properties of mesenchymal stem cells 
apply. It remains to be seen how effective these properties will be outside 
the normal biological home of the stem cells. 

Given the burden of incurable disease, rapid bench-to-bedside transla- 
tion is unquestionably crucial. Buta lot more research into the deep biol- 
ogy of stem cells is needed. Some trials approved by regulatory agencies 
may yield useful results, but that is a long shot without strong research 
data. Atleast they are safer under regulatory eyes. Unregulated treatment 
— such as that issued on a compassionate basis by the Stamina Founda- 
tion in Brescia, Italy, which led to the current ministerial decree — is 
more worrying. The second parliamentary chamber needs to heed inde- 
pendent expert advice before voting to deregulate stem-cell therapies. 

Stem cells will help to develop treatments for currently incurable dis- 
eases. But we are not there yet, whatever the smoke signals may say. m 


Due credit 


Nature’s podcast charts 12 landmark 
discoveries in the history of science. 


the mundane. A humble paperclip was biophysicist Raymond 

Gosling’s choice. Late one night in May 1952, ina chemistry lab 
in London, the PhD student wrapped DNA around a paperclip to keep 
the molecule’s fibres stretched taut in front of an X-ray source so that 
he could analyse their structure. The result was the celebrated ‘photo- 
graph 51’ — the image that told James Watson that DNA strands curl 
around each other like a twisted ladder, and that the specific pairings 
in the rungs are key to the mechanism of inheritance. 

The rest of that story is legend. Based on their work at the University 
of Cambridge, UK, Watson and Francis Crick published their paper in 
the pages of this journal, including a beautiful diagram of the double 
helix that was hand-drawn by Odile Crick, Francis’ wife (J. D. Watson 
and F. H.C. Crick Nature 171, 737-738; 1953). 

Next week marks the 60th anniversary of the publication of the 
famous Watson and Crick paper — and that of two other papers on 
DNA that appeared in the same issue. Neither was so high profile, but 
each was essential to the structure's discovery. Both were written by 
scientists at King’s College London: one by Maurice Wilkins and his 
colleagues Alec Stokes and Herbert Wilson, and the other by Gosling 
and his PhD supervisor, Rosalind Franklin. 

Only Gosling (now 86) and Watson survive from that group of seven 
scientists. Watson has never been shy, and his compelling swagger 
helped to establish another colossus of biology, the Human Genome 


| } ven the greatest scientific discoveries come with an element of 
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Project. But the supporting cast matters too, even on the biggest stages. 
Gosling is a Nature author, even if he is largely forgotten when the 
story of DNA is told. To mark the anniversary of his paperclip-inspired 
contribution, Nature has interviewed him. You can hear the results 
at go.nature.com/lizfik, in the first of a series of monthly podcasts to 
highlight 12 key scientific discoveries from the pages of this journal. 
(Future episodes in the ‘Pastcast’ series will plunder the Nature archive 
to investigate the discovery of X-rays in 1896, the early days of quantum 
theory in the 1920s and the first report of the ozone hole in 1985.) 

In the interview, a humble Gosling fondly recalls that Franklin’s 
response to Crick and Watson’s model of the double helix was gracious 
and sanguine: “She didn’t use the word ‘scooped. What she actually 
said was, ‘We all stand on each other’s shoulders.” 

All three papers appeared with no peer review — unthinkable now. 
The head of the King’s College biophysics unit, John Randall, belonged 
to the same London gentleman’s club — the Athenaeum — as Lionel 
(Jack) Brimble, co-editor of Nature. Randall convinced Brimble to 
publish Wilkins’ paper alongside Watson and Crick’s; Franklin’s paper 
was added only after she petitioned for its inclusion. 

This cavalier approach to submissions extended to the awarding of 
credit. Watson and Crick’s paper features only a glancing concession 
to being “stimulated by a knowledge of the general nature of [Wilkins 
and Franklin's] unpublished experimental results and ideas” There is 
no mention of Gosling by name. Gosling left research soon after, with 
no bitterness; in his words, he “was no good at it”. 

Discoveries take ego, genius, conflict, inspiration and fierce ambi- 
tion. But they also need the hard graft of PhD students who beaver 
away late into the night and improvise with what 
they find in the stationery cupboard. They do 
not always receive the recognition that they 
deserve. Raymond Gosling is a good place to 
start to reverse that trend. m 
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WORLD VIEW .jernsicor sen 


access to the journal Bioinformatics, a top-ranked title in math- 

ematical and computational biology. Many more publications are 
likely to disappear from Greek libraries. The Ministry of Education has 
not paid the bills for its subscription bundles. The largest publishers — 
including Elsevier, Springer and Taylor & Francis — have threatened 
to suspend access. Others have done so already. 

The denial of scholarly papers, the lifeblood of research, to Greek 
scientists could mark the beginning of the end for creative science at 
universities and research institutes. We will no longer be able to keep 
up with international contributions. In areas such as biomedicine, it is 
crucial to have access to the latest information. Many Greek research- 
ers, unable to afford personal subscriptions to their favourite journals, 
are already considering reviving a practice that was common a decade 
or so ago — contacting friends and colleagues 
in foreign research centres and asking them to 
fax or e-mail articles. 

For many readers of Nature, the hardship 
faced by Greek scientists will come as no sur- 
prise. The country is reeling from six straight 
years of recession and unprecedented auster- 
ity measures. More than one-quarter of Greek 
people are out of work. 

Iam one of them. I am a biologist with a 
PhD in biological chemistry from the Aristo- 
tle University of Thessaloniki. In 2003, I went 
to Spain to work as a postdoc at the National 
Centre of Biotechnology in Madrid. In 2008, I 
returned to Greece as a research scientist with 
the National Hellenic Research Foundation in Athens, ona succession 
of short-term contracts. In March 2011, I was elected assistant professor 
of cell biology at the faculty of medicine of the University of Thessaly 
in Larissa. But I never began work there: I am one of about 800 faculty 
members who are still waiting to take up appointments around the 
country because the government refuses to approve the budget nec- 
essary for their salaries. They are distinguished scientists, many with 
years of postdoctoral experience, who have been selected through a 
long and demanding process and have been appointed by the heads of 
their respective universities. 

The departments that selected these 800 faculty members are strug- 
gling to teach their students. In 2011, for the first time in decades, 
the Ministry of Education placed no new university professors. The 
scientific and professional prospects of young scholars in Greece are 
evaporating; this will leave the country’s universities lifeless and impo- 
tent. Budgets for research institutes have been 


cience in Greece is going backwards. This month, researchers lost 


reduced by 30%. The 2013 education budget NATURE.COM 
will cut funds bya further 14% and condemns __ Discuss this article 
Greece to scientific and educational dormancy. _ onlineat: 

There are no signs that the Greek government _0.llafllre.com/psqtsw 


HUMAN POTENTIAL 
SHOULD BE 


NOURISHED 


BY URGENT 
GOVERNMENT 


ACTION. 


Austerity-led brain drain 
is killing Greek science 


Lack of funding and recruitment freezes are driving young researchers out 
of the country, warns Varvara Trachana. 


understands that long-term commitment to funding science and edu- 
cation must be part of the strategy to boost economic growth. In 2007, 
even before the most recent cuts, university and research funding in 
Greece stood at 0.6% of gross domestic product in 2007, already far 
below the European Union average of 1.9%. 

Wages of researchers and faculty members have been reduced by 
20%. If someone tells you that the Greek economy fell because of giant 
public-sector salaries, tell them that the average monthly wage ofa 
university lecturer here is now around €1,000 (US$1,300). Researchers 
and professors who have spent years building their careers are asking 
whether it was worth it. 

Scientists have adapted. They have learned to use the bare minimum 
of expensive reagents. They have become skilled at working with their 
overcoats on when deliveries of heating oil fail to materialize. 

But many young scientists are heading 
abroad. In 2010, about 120,000 Greek scholars 
lived and worked elsewhere, about one-tenth 
of the total. The number is now estimated at 
150,000. The young, skilled workforce, a key 
factor for economic development, is disap- 
pearing exactly when society needs it most. 
I, too, am considering whether to leave. The 
situation in Greece, combined with plans by 
European Union leaders to cut the research 
and development budget, paints a bleak pic- 
ture for future generations. 

Greek science is worth investing in and 
worth saving. In 2012, against all the odds, the 
proportion of the country’s research that con- 
tributed to the top 1% of most-cited articles was 13th in the world, 
above Canada, Italy and France. This human potential should be 
nourished by urgent government action. 

A good place to start is to resist calls from existing professors to raise 
their retirement age from 67 to 70. The change has been suggested 
as a short-term fix for academic departments stretched to breaking 
point by the hiring freeze and retirements. Such a move would elimi- 
nate any chance that the new professors will be placed, given that the 
national programme of fiscal stability calls for the public sector to 
be dramatically reduced by making only one appointment for every 
ten retirements. It would block the natural and necessary renewal of 
university personnel with new blood. It would further accentuate the 
rapid ageing of Greek universities and leave faculties as shells contain- 
ing only a handful of older professors and next to no lecturing staff 
and young researchers. We know from evolutionary biology that such 
micro-societies soon become extinct. = 


Varvara Trachana is an assistant professor of cell biology at the 
University of Thessaly in Larissa (appointment pending for two years). 
e-mail: vtrachana@gmail.com 


18 APRIL 2013 | VOL 496 | NATURE | 271 


© 2013 Macmillan Publishers Limited. All rights reserved 


RESEARCH HIGHLIGHTS 


| NEUROSCIENCE 
Brain signature 
for thermal pain 


Brain activity could one day 
help physicians to monitor 
pain, for which there is no 
reliable physiological test. 
Using a type of magnetic 
resonance imaging that 
shows when certain parts 
of the brain are activated, 
Tor Wager at the University 
of Colorado Boulder and 
his colleagues started out 
by scanning the brains 
of 20 volunteers as they 
experienced warm to 
painfully hot sensations 
on their arm. They used 
these data to find a pattern 
of neural activation and 
deactivation that consistently 
appeared when the volunteers 
were exposed to painful heat. 
Further tests showed that this 
signature could discriminate 
physical pain from other 
stimuli, such as social pain 
and recollections of pain. 
The signature was reduced 
by analgesics. Such patterns 
could one day lead to more- 
objective assessments of pain. 
N. Engl. J. Med. 368, 1388-1397 
(2013) 


Roots spur on 
helpful biofilms 


Components of plant cell 
walls can induce a bacterium 
that is used as a plant 
fertilizer to assemble itself 
into sticky mats known as 
biofilms. 

Biofilms are often 
associated with hard-to- 
treat infections in animals, 
but those formed by the soil 
bacterium Bacillus subtilis 
actually protect plants from 
pathogens. A team led by 
Roberto Kolter at Harvard 
Medical School in Boston, 
Massachusetts, reports how 


Selections from the 
scientific literature 


Desert plants reap no rewards 


Atmospheres that are enriched with carbon 
dioxide can boost plant productivity in some 
ecosystems, but drought may prevent faster 
or greater growth in desert plants, such as 
those of the Mojave Desert (pictured) in the 


southwestern United States. 


Researchers led by Beth Newingham then 
at the University of Nevada in Las Vegas 
measured productivity above and below 
ground for plants that were exposed to high 
levels of atmospheric carbon dioxide at 
experimental sites in the Mojave Desertfor a 


decade. 


signals from the roots of 
the model plant Arabidopsis 
thaliana prompt bacteria 

to join up. Pectin and other 
polysaccharides on the 
surface of plant cell walls 
activate bacterial genes 
known to induce biofilms. 
The polysaccharides also 
form the raw materials that 
bacteria use to synthesize 
the extracellular matrix that 
holds the biofilm together. 
The plant, therefore, provides 
both the environmental 

cue and the building blocks 
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They found that dominant shrub and grass 
species that were exposed to higher levels of 
carbon dioxide showed some gains in weight 
and photosynthesis in wet years compared 
with plants at control sites that were exposed 


to ambient carbon dioxide, but that these 


suggest. 


to promote beneficial 
colonization of its roots. 
Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1218984110 (2013) 


CONSERVATION BIOLOGY 


Old evidence for 
fewer fish 


The price of fish in the 1800s 
has helped to reveal the long- 
term effects of bottom trawling, 
a fishing practice in which nets 
are dragged across the seabed. 
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gains were not sustained during drought. 
Desert ecosystems, which cover around 

one-third of Earth’s land surface, may be more 

limited by water than by carbon, the authors 


Glob. Change Biol. http://dx.doi.org/10.1111/ 
gcb.12177 (2013) 


Ruth Thurstan, now at the 
University of Queensland 
in Brisbane, Australia, and 
her colleagues examined the 
testimonies of hundreds of 
fishermen in the northeast of 
England about changes to fish 
stocks and practices during the 
nineteenth century. 

Statements about catch 
rate, price and fish size, which 
were given during two Royal 
Commissions of Enquiry (in 
1863-66 and in 1883-85), 
revealed a perception by 
fishermen that numbers of 


FRASER HALL/ROBERT HARDING PICTURE LIBRARY 


CHRISTOPHER FRAZEE 
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white fish, such as cod, had 
fallen by 64% during their 
careers. Many fishermen 
blamed the declines on 
trawling. These largely 
forgotten records suggest that 
trawling began to affect fish 
stocks long before scientific 
monitoring and official 
statistics began. 

Fish Fish. http://dx.doi. 
org/10.1111/faf.12034 (2013) 
For a longer story on this research, 
see go.nature.com/5rbcvq 


Leafy trap for 
bedbugs’ legs 


A natural Eastern European 
folk remedy catches bedbugs 
more effectively than synthetic 
mimics. 

Catherine Loudon at the 
University of California, 
Irvine, and her colleagues used 
scanning electron microscopy 
and videography to document 
why bean leaves, which 
are traditionally scattered 
around an infected bed to trap 
bedbugs (Cimex lectularius) 
are effective. Tiny hairs on the 
leaves, known as trichomes, 
trap the insects by impaling 
their feet and hooking their legs 
(pictured). The researchers 
designed a synthetic 
mimic of the hairs with a 
polyvinylsiloxane moulding 
material often used in dentistry, 
and ahybrid mimic made of 
synthetic stalks and natural 
trichome tips. Although both 
mimics snagged the bugs, they 
could not impale them. This 
suggests that a better bug trap 
may need to recapitulate the 
mechanical properties of bean 
leaf trichomes more closely. 

J. R. Soc. Interface 10,20130174 


(2013) 


CELL BIOLOGY 


Super-enhancers 
rule genes 


The myriad biomolecules that 
regulate gene expression are 
governed by an unanticipated 
layer of control. 

Researchers led by Richard 
Young at the Massachusetts 
Institute of Technology 
in Cambridge have found 
assemblies of enhancers 
— segments of DNA that 
associate with regulatory 
proteins and attach to genes 
to switch them on — that they 
have named super-enhancers. 
Compared with ordinary 
enhancers, super-enhancers 
bind to more proteins that 
activate gene transcription 
and are more sensitive to 
perturbation. The teams 
found that some cancer 
cells assemble pathological 
super-enhancers. 

Human cells contain 

tens of thousands of 
genes and many more 
enhancers, but most are 
controlled by only a few 
hundred super-enhancers 
that bestow characteristic 
properties on each cell type, the 
authors suggest. 

Cell 153, 307-319; 320-334 
(2013) 

For a longer story on this research, 
see go.nature.com/spue5g 


Symbionts set 
squid’s clock 


The tick-tock of an animal’s 
daily clock can be set, in part, 
by beneficial bacteria. 

At night, the bacterium 
Vibrio fischeri glows in 
acompartment in the 
underbelly of its symbiotic 
host, the Hawaiian bobtail 
squid (Euprymna scolopes; 
pictured), hiding the squid’s 
shadow in the moonlight. 
Margaret McFall-Ngai at the 
University of Wisconsin— 
Madison and her colleagues 
have found that the light- 
producing bacteria also cause 
the squid to boost expression 
of its light-responsive gene 
escry1 in the colonized 
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Cheap, colourful solar cells 
A light-harvesting crystal can make 


ea : hes efficient, colourful and low-cost devices 
2\Mar-18 April that convert sunlight into electricity. 


Researchers led by Sang II Seok at the 
Korea Research Institute of Chemical Technology in Daejeon, 
South Korea, created metal-organic hybrid materials that 
were composed of lead; methylammonium; and iodine 
or bromine, or both. Semiconductors used in solar cells 
can be coated in these materials, which absorb light and 
displace electrons that the semiconductor transfers to an 
external circuit. By varying ratios of bromine and iodine, the 
researchers tuned the material’s stability and light-absorption 
properties to produce solar cells with a variety of translucent 
colours, from dark brown to yellow. On average, the cells 
could convert 10% of sunlight energy into electricity, with 
amaximum of 12.3% — a performance that matches 


(2013) 


compartment. 
This boost, which 
helps to set daily 
rhythms, did not occur in 
bacteria-free squid, even if 
they were exposed to blue light 
to mimic bacterial glow. But 
when researchers provided 
squid with light as well as bits 
of the bacterium’s cell envelope, 
escry] cycling returned. 

The researchers suggest that 
microbiota could also set daily 
rhythms in mammals. 

mBio 4, €00167-13 (2013) 


IMMUNOLOGY 


Proteins that 
rouse also quash 


Signalling pathways that 
initially orchestrate potent 
antiviral attacks can, 
paradoxically, also help to 
sustain chronic infections. 
Proteins called type I 
interferons (IFN-I) curb 
viral replication during 
early stages of infection, but 


competing low-cost options. 
Nano Lett. http://dx.doi.org/10.1021/nl400349b 


are unable to clear chronic 
infections. To solve this long- 
standing puzzle, independent 
groups led by David 
Brooks at the University 
of California, Los Angeles, 
and Michael Oldstone at the 
Scripps Research Institute 
in La Jolla, California, 
studied mice infected with 
strains of lymphocytic 
choriomeningitis virus that 
cause chronic infection. 
Both groups found that 
IFN-I produces an initial 
burst of antiviral activity, but 
then depresses the immune 
system over the long term, 
allowing the virus to persist. 
Chronic activation of 
the immune system can 
damage tissue, and the 
authors speculate that IFN-I 
signalling dampens the 
immune-system response to 
limit this damage. Inhibiting 
IFN-I signalling may help 
to control chronic viral 
infections, they say. 
Science 340, 202-207; 207-211 
(2013) 
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SEVEN DAYS nescnsi 


US budget proposal 
President Barack Obama 
proposed nearly US$143 billion 
for research and development 
in his 10 April budget plan, an 
increase of about 1% above 
2012 levels. Applied research 

is slated to get $1.8 billion 
more than basic science in the 
battle for funds. See go.nature. 
com/8i6bon and page 277 

for more. 


Global health plan 
The United Nations Children’s 
Fund and the World Health 
Organization on 12 April 
announced a US$6.7-billion 
plan to treat and prevent 
childhood pneumonia and 
diarrhoea, which together kill 
2 million children under the 
age of 5 each year. The Global 
Action Plan for the Prevention 
and Control of Pneumonia and 
Diarrhoea aims to cut deaths 
from these conditions by 67% 
and 95%, respectively, by 

2025. It outlines 15 initiatives 
including wider vaccination, 
improved sanitation, provision 
of antibiotics and promotion of 
hand-washing. 


Polio campaign 
Leading scientists have 
supported a new global 
strategy to rid the world of 
polio by 2018. More than 
400 researchers and public- 
health experts signed a 


NUMBER CRUNCH 


The amount paid at auction 
for Francis Crick’s Nobel 
medal. The buyer was Jack 
Wang, chief executive of 
Chinese firm Biomobie, which 
sells electromagnetic aids for 
healing. See go.nature.com/ 
w43awo for more. 


Drought kills agricultural profit 


Land degradation and desertification are causing 
huge reductions in agricultural profits in the 
world’s poorest nations, according to a report 
presented at the United Nations Convention to 
Combat Desertification (UNCCD). The first 
assessment of its kind in more than 20 years 
calculates that Guatemala, for example, lost 24% 
ofits agricultural gross domestic product to 
environmental degradation. The findings were 
presented at the UNCCD’s second scientific 


letter published on 11 April, 
endorsing plans to halt the 
spread of the virus in Nigeria, 
Pakistan and Afghanistan 
(the three countries never 
to interrupt transmission), 
and to integrate polio 
immunization with the 
delivery of other childhood 
vaccines. See page 290 for 
more on polio eradication. 


The costs of storms 
Economic losses from 
powerful thunderstorms in the 
United States (those that cause 
more than US$250 million in 
damages) doubled between 
1970 and 2009, even when 
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normalized for socioeconomic 
changes and the growth of 
urban areas. Risk researchers 
with the insurance company 
Munich Re, based in Germany, 
presented the result at the 
meeting of the European 
Geosciences Union in Vienna 
last week. The increase 
mirrors the growing frequency 
of meteorological conditions 
that favour formation of 
thunderstorm cells, they say. 
See go.nature.com/qfltrq for 
more. 


Telescope go-ahead 
Hawaiian officials have 

given permission for the 
construction of the Thirty 
Meter Telescope (TMT) on 
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conference, on 9 April in Bonn, Germany. Nearly 
3.7 million people in East Africa still require food 
assistance after droughts in Kenya (pictured) 

and other nations wiped out crops in 2011, the 
UNCCD says. “Desertification, land degradation 
and drought are key constraints to building social 
and environmental resilience, achieving global 
food security and delivering meaningful poverty 
reduction,’ said UNCCD executive secretary Luc 
Gnacadja at the meeting. 


the 4,200-metre-high Mauna 
Kea mountain in Hawaii, 
project leaders announced 

on 13 April. Building could 
begin as early as April 2014. 
Thirteen telescopes already 
dot the sacred mountain, but 
the TMT would be the largest; 
the biggest optical instruments 
currently on Mauna Kea 

are the twin 10-metre Keck 
telescopes. See go.nature.com/ 
cy6r5s for more. 


Publishing deal 


The British start-up company 
Mendeley — a social academic 
network based in London, 
through which more than 
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2 million users share and 
comment on research papers 
and references — has been 
bought by Elsevier, the 
Amsterdam-based science- 
publishing giant announced 
on 9 April. Neither company 
declared how much Mendeley 
had sold for, but observers 
familiar with the deal said that 
Elsevier had paid £45 million 
(US$69 million). 


Lab kit billions 


A company that manufactures 
scientific instruments 

has announced one of the 
largest business deals of 

2013 so far. Thermo Fisher 
Scientific, headquartered in 
Waltham, Massachusetts, 

said on 15 April that it will 
buy biotech-equipment 
manufacturer Life 
Technologies of Carlsbad, 
California, for US$13.6 billion. 
The aim of the acquisition is 

to create a leading company in 
lab-kit production for research 
in proteomics, genomics and 
cell biology. See go.nature. 
com/27q9hf for more. 


PEOPLE 


IVF pioneer dies 
Robert Edwards, Nobel- 
prizewinning pioneer of 
human in vitro fertilization 
(IVF), died on 10 April in 
Cambridge, UK, aged 87. 
Edwards (pictured) and his 
colleague Patrick Steptoe, who 
died in 1988, began research 


TREND WATCH 


A vaccination programme to 


cut down on Salmonella bacteria 


in poultry is having success 
in Europe, where reported 
human cases fell for a seventh 


consecutive year, the European 


Food Safety Authority said on 
9 April. But human disease 
caused by Campylobacter, the 
most common cause of food 


poisoning, is increasing: because 
infection does not cause disease 


in poultry, it is difficult to 
create a vaccine that generates 
an effective immune response 
against the bacteria. 


into the technique in the late 
1960s despite public criticism 
and tight research budgets. 
The first child conceived 
through IVE, Louise Brown, 
was born in 1978. Edwards 
was the sole recipient of 

the 2010 Nobel Prize in 
Physiology or Medicine for 
his work; Steptoe was not 
included because the prizes are 
not awarded posthumously. 


Jewell confirmed 
The US Senate voted on 

10 April to confirm Sally 
Jewell, chief executive of the 
outdoor retailer Recreational 
Equipment, as the new 

head of the Department of 
the Interior, replacing Ken 
Salazar. As the department's 
chief, Jewell will oversee 
conservation, recreation, 
energy development and 
mining activities on more than 
200 million hectares of public 
land and almost 700 million 
hectares offshore. 


| FUNDING 
Space plans 


Russia will give its Roscosmos 
space programme a massive 
US$52-billion rise between 
now and 2020, President 
Vladimir Putin announced 
on 12 April. The first mission 
from its new Vostochny space 
port will be sent by 2015 with 
manned missions launching 
in 2018. ‘Super-heavy’ rockets 
capable of Moon missions are 
slated to be ready by 2020. See 
go.nature.com/grpejn for more. 


Bird flu spread 


The H7N9 avian flu virus has 
expanded its geographical 
range, with two human 

cases detected in Beijing, 
northern China, and two in 
the central Henan province. 
Previously, the virus had 

been limited to Shanghai and 
neighbouring regions on the 
eastern seaboard. As Nature 
went to press, 63 human cases 
including 14 deaths have 
been reported since China 
announced the first cases on 
31 March. There is not yet 
any evidence of sustained 
human-to-human spread. See 
go.nature.com/bklq5s for more. 


Romanian protest 


All 19 members of Romania’s 
National Research Council 
resigned on 12 April in 
protest at budget cuts and 


BATTLE AGAINST FOOD-BORNE ILLNESS 


Europe is cutting Salmonella infections — but making little progress 


in stopping Campylobacter. 
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SEVEN DAYS iISuaaa) 
COMING UP 


23 APRIL 

The US Institute of 
Medicine holds a 

public meeting in 
Washington DC to talk 
about priorities for 
research to reduce the 
threat of firearm-related 
violence. 
go.nature.com/o5zump 


22-25 APRIL 

In Darmstadt, Germany, 
experts meet ina 
quadrennial gathering 
to discuss how to detect 
and clear up space 
debris — useless chunks 
of satellites and rockets 
that clutter Earth’s orbit. 
go.nature.com/uoyjtj 


interference. The government 
had told the council how 

it should distribute basic 
research grants after slashing 
its budget on 8 April. Two days 
later, the education ministry 
launched a call for proposals 
in applied research, but under 
new rules removing a legal 
requirement for international 
peer review. The council says 
that such external review 

is necessary: in January, its 
respected peer-review system 
identified plagiarism in grant 
applications. 


To steal a ship 


A woman was detained 

by police in San Diego, 
California, on 9 April after 
apparently attempting to steal 
a research vessel belonging 

to the Scripps Institution of 
Oceanography. Members of 
the institute’s science team 
were aboard the 52-metre- 
long New Horizon at the time 
but slept through the incident, 
says a Scripps spokesman. The 
vessel is now en route to Point 
Conception in California, 
where it will replace a climate 
and ocean-acidification 
monitoring station. 


> NATURE.COM 
For daily news updates see: 
Www.nature,com/news 
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US President Barack Obama on 10 April unveiled a budget that proposes large increases for advanced manufacturing programmes. 


A back seat for basic science 


Translational research wins in Obama’s budget, but its economic value remains uncertain. 


BY ERIC HAND, BETH MOLE, LAUREN 
MORELLO, JEFF TOLLEFSON, MEREDITH 
WADMAN AND ALEXANDRA WITZE 


he magnolia trees behind the White 

| House were decked in blossoms as US 
President Barack Obama announced 

his 2014 budget proposal on 10 April. They 
emphasized the lateness of the proposal, 
which is usually unveiled in early Febru- 
ary. The short-lived flowers could also have 
symbolized a fragile optimism about the US 
economy. But with unemployment at a wor- 
risome 7.6% and Congress insistent on deficit 
reduction, Obama's budget is cautious. The 
US$3.8-trillion proposal would undo the 


across-the-board cuts known as sequestration, 
but includes just $143 billion for research and 
development, a mere 1% rise from 2012. 

The budget reflects not only caution, but 
also the Obama administration’s emerging 
attitude towards research: that science needs to 
be good for something, namely jobs and cures. 

“To make America a magnet for good jobs, 
this budget invests in new manufacturing 
hubs,” said Obama. “We'll spark new Ameri- 
can innovation and industry with cutting-edge 
research like the initiative I announced to map 
the human brain and cure disease.” 

The first reference was to a one-time, $1 -bil- 
lion request from the National Institute of 
Standards and Technology in Gaithersburg, 
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Maryland. The money would set up 15 manu- 
facturing hubs, modelled after a pilot project 
devoted to three-dimensional printing that 
was established in 2012 in the former steel- 
manufacturing town of Youngstown, Ohio. 
The second was to a multibillion-dollar 
brain-mapping project that would, in its 
first year, receive a total of $110 million from 
the National Institutes of Health (NIH) in 
Bethesda, Maryland; the National Science 
Foundation (NSF) in Arlington, Virginia; 
and the Defense Advanced Research Projects 
Agency in Arlington. The project's first steps 
— creating tools to record the activity of mil- 
lions of neurons — fall under basic research, 
but the White House has promoted the > 


18 APRIL 2013 | VOL 496 | NATURE | 277 


| NEWS IN FOCUS 


> project as key to understanding, treating 
and even curing brain diseases. 

Those initiatives embody a spirit that 
runs throughout the proposal, reserving big 
boosts for research areas that promise near- 
term economic and social benefits (see “Tight 
times’). Adam Jaffe, an economist at Brandeis 
University in Waltham, Massachusetts, 
who specializes in science policy, says that 
the administration’s emphasis is natural: it 
makes funding science more politically pal- 
atable in a difficult economic climate. “They 
want to make clearer arguments as to why 
this is helping people in the relatively short 
run, rather than helping pointy-headed sci- 
entists,” he says. 

But researchers unaccustomed to thinking 
about applications are starting to complain, 
says Jaffe. “They think it’s outrageous that the 
NIH should ask them to talk about the actual 
health-care implications of their research.” 

Shifting money towards applications can 
leave basic research in the cold because the pri- 
vate sector will not support it, notes Barry Toiv, 
a spokesman for the Association of American 
Universities in Washington DC. Commercial- 
izing research discoveries can be a good idea, 
he says, but only if “the fundamental federal 
responsibility of funding basic research isn’t 
undermined”. 


PHYSICAL SCIENCES 

Under the budget proposal, NSF funding would 
rise by 8.4% over 2012 levels (used asa baseline 
because Congress did not finalize 2013 fund- 
ing until late March). Much of that increase 
would go to purpose-driven initiatives such as 
the Cyber-Enabled Materials, Manufacturing 
and Smart Systems project, which would see its 
budget more than double, to $300 million. The 
project funds research into new materials and 
robotics that could aid manufacturing. 

The Innovation Corps (I-Corps), a train- 
ing programme established in 2011 to guide 
researchers in turning their discoveries into 
products, was a pet project of former NSF 
director Subra Suresh, who stepped down last 
month. The programme's budget would more 
than triple to $25 million under Obama’s 
proposal. “I think this fulfils a need for the 
country,’ says Dean Chang, associate vice- 
president of innovation and entrepreneurship 
at the University of Maryland in College Park, 
who was part of a team that won $3.75 mil- 
lion from I-Corps this year to start a regional 
training centre. A dash of entrepreneurialism 
could allow “academia to contribute to the 
economic development of the country’, adds 
James Chung, executive director of the Office 
of Entrepreneurship at the George Washington 
University in Washington DC. 

Engineering, with its applied focus, would 
get the largest boost of any research division at 
the NSF: 10.3%. That continues a trend: since 
the first Obama budget, for 2010, engineering 
funding has risen by 17%, to $911 million. Yet 
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TIGHT TIMES 


In US President Barack Obama’s budget request for 2014, applied research accounts for most of the 


increases in science funding (US$ billions). 


Agency 2012 2013* 
actual | estimated 

Biomedical research and public health 

National Institutes of Health | 30.70 | 30.90 

Centers for Disease Control | 5.66 5.69 

and Prevention 

Food and Drug 2.51 2.52 

Administration 

Physical sciences 

National Science fal 7.08 

Foundation 

NASA (science) s\(0)7/ By ll 

Department of Energy 4.93 4.90 

Office of Science 

National Institute of ONS) 0.76 

Standards and Technology 

Earth and environment 

Environmental Protection 8.45 8.50 

Agency 

National Oceanic and 491 5.26 

Atmospheric Administration 

US Geological Survey 1.07 1.08 


2014 Details 

requested 

31.17 Nearly flat, but translational-science 
centre gets a 16% boost from 2012 

9) 22 Includes $10 million for research on the 
causes and prevention of gun violence 

2.56 Overall $4.7-billion budget includes 
more than $2 billion in user fees 

7.63 An 8% rise, with a robotic- 
manufacturing programme seeing a 
large gain 

BO Mission to capture an asteroid debuts, 
and funding to monitor near-Earth 
objects would double 

515 Increases funds for applied and 
interdisciplinary energy research 

O13 Agency is also pursuing a one-off 
request for $1 billion to fund up to 
15 manufacturing-innovation institutes 

S15 A3.5% decline from 2012 includes 
hits to research, as agency prioritizes 
cutting greenhouse-gas emissions 

5,45 A plan to maintain continuity in 
weather-satellite data would shift 
climate sensors to NASA 

iL 7/ Boosts spending on ecosystem 
research and climate science 


*2013 figures do not include the roughly 5% across-the-board cut arising from the sequester. 


Source: White House Office of Management and Budget 


the maths and physical-sciences division has 
barely gained at all, increasing from $1.37 bil- 
lion in 2010 to $1.39 billion for 2014 (see 
‘Apply thyself”). 


BIOMEDICINE AND PUBLIC HEALTH 

The NIH, ever popular with the public and 
with patients’ groups, often gets more from 
Congress than the president requests. For now, 
the administration is calling for a boost of just 
1.5% over 2012 levels, to $31.2 billion. 

Projects that aim to speed treatments from 
labs to patients did much better. The fledgling 
National Center for Advancing Translational 
Sciences received a 16% boost to $666 million 
— by far the biggest jump, proportionately, 
among the agency’s 27 institutes and centres. 
A big chunk of that — $40 million — is an 
increase for the Cures Acceleration Network, 
which aims to make awards of up to $15 mil- 
lion for work on ‘high need’ cures that industry 
is unlikely to develop. 

The White House also returned to a prob- 
lem that it first flagged up for special atten- 
tion last year: Alzheimer’s disease, which cost 
the nation as much as $215 billion in 2010, 
according to a study published on 4 April 
(M. D. Hurd et al. N. Engl. J. Med. 368, 1326- 
1334; 2013). The budget would add $80 mil- 
lion to the roughly $500 million that the NIH 
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is already spending on the disease. 

“The NIH has always had this dual con- 
figuration as a science agency and as a health 
agency,’ says Irwin Feller, a science economist 
at Pennsylvania State University in University 
Park, adding that the second function has 
expanded in recent years. The proportion of 
NIH money spent on basic research has fallen 
from two-thirds in the 1980s and 1990s to just 
over half now, says Matt Hourihan, director of 
research and development budget and policy 
at the American Association for the Advance- 
ment of Science in Washington DC. 

Supporters of basic research “don’t particu- 
larly like” that trend, says Ben Corb, director 
of public affairs at the American Society for 
Biochemistry and Molecular Biology in Rock- 
ville, Maryland. He says that the NIH is feeling 
pressure from members of Congress, who “are 
asking: what are we getting for the $30 billion 
we're giving the NIH? Where are our cures? 
Where are our treatments?” 


ENERGY 

The Department of Energy (DOE) has always 
embodied the tension between basic and 
applied research. Its mission is to support 
applied work on energy extraction, even as its 
Office of Science funds blue-sky research that 
may not have applications for decades. 


SOURCE: NSF 
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The White House request would boost the 
DOE's budget to $28.4 billion, nearly 8% above 
2012 levels, with the largest increase — almost 
$1 billion — for initiatives targeting energy 
efficiency, renewables and advanced manufac- 
turing. Clean energy “has been the signature 
of this administration’, says Michael Lubell, 
director of public affairs for the American 
Physical Society in Washington DC. 

Investment would continue to flow to trans- 
lational science at the Energy Innovation Hubs 
and Energy Frontier Research Centers initiated 
by departing energy secretary Steven Chu. The 
administration is also asking for a 38% rise in 
funding for high-risk work at the Advanced 
Research Projects Agency-Energy (ARPA- 
E), which was designed to bridge a ‘valley 
of death’ between basic and applied energy 
research. Support for the hubs and ARPA-E has 
faced resistance from Republicans in Congress, 
wary of picking winners in the applied sector. 
But Obama’s administration hopes to launch 
prize schemes that, by providing incentives for 
innovation, may draw more bipartisan support. 
The budget includes $200 million in a one-time 
‘race to the top for states that provide the best 
proposals for investing in energy efficiency and 
advanced electricity-grid technologies, and a 
$25-million prize for the first natural-gas com- 
bined-cycle plant to be equipped with carbon 
capture and sequestration technologies. 

One of the largest increases in the Office of 
Science was reserved for basic energy sciences, 
a programme that, despite its name, supports 
applied ‘use-inspired’ materials research. Its 
budget would grow by 10% from 2012. Mean- 
while, high-energy physics would suffer a 
1.8% loss. With the shutdown of the Tevatron 
at Fermi National Accelerator Laboratory in 
Batavia, Illinois, in September 2011, the locus 
of activity in the field has shifted to the Large 
Hadron Collider at CERN, Europe's high- 
energy physics laboratory near Geneva, Swit- 
zerland. The budget cut will force officials at the 
DOE to make tough choices about remaining 
physics experiments, such as neutrino studies. 


SPACE AND EARTH SCIENCES 

NASA has been mission-oriented ever since 
it was tasked in 1961 with going to the Moon. 
Although some officials still advocate a return 
to the Moon (see Nature 492, 161-162; 2012), 
this year, the president’s budget proposal 
arrived on the same day as a congressional 


APPLY THYSELF 


Emphasizing applications, engineering has seen 
large gains at the US National Science Foundation. 
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hearing on the dangers of near-Earth asteroids, 
which seem to be the agency’s current focus. 
NASA wants to double the $20 million spent 
annually on hunting dangerous space rocks, 
mainly to track smaller ones and to identify a 
candidate for a surprising mission. If Congress 
approves the administration’s proposal, NASA 
would perform the first-ever asteroid-capture 
mission, in which a robotic spacecraft would 
snatch an asteroid 7-10 metres across, and tow 
it back to the vicinity of the Moon. Astronauts 
aboard the Orion crew vehicle could then visit 
and study the rockas early as 2021. The agency 
is asking for $38 million to develop solar elec- 
tric propulsion for the robotic part of the mis- 
sion, and $40 million to study how astronauts 
would work with the ‘uncooperative target: 
The Obama administration has emphasized 
NASAs Earth science, which would receive a 
5% increase from 2012 levels. Other Earth- 
observing programmes, run by the National 
Oceanic and Atmospheric Administration 
(NOAA), also benefit in the request. Funding 
for NOAAS satellite division, which operates 
geostationary weather probes and the Jason-3 
altimetry mission to monitor sea-level rise, 
would increase by 17%, to $2.2 billion, out- 
pacing the 11% increase for the agency overall. 
Astrophysics and planetary science at NASA 
would suffer cuts of 1% and 19%, respectively, 
from 2012 levels. Yet those two divisions would 
still get big prizes: the James Webb Space Tele- 
scope, an infrared successor to the Hubble 
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Space Telescope that is on track for a 2018 
launch, and a Mars rover to launch in 2020, 
built of spare parts from the Curiosity rover. 


REALITY CHECK 

As with all presidential budget requests, 
Obamas priorities are contingent on Congress. 
Mark Muro, a senior fellow at the Brookings 
Institution, a think tank in Washington DC, 
says that many of Obama's budget initiatives 
may amount to little more than “performance 
art” given the political focus on the deficit. 

Ifthe emphasis on commercially tinged basic 
research is to continue, science and technol- 
ogy economists agree that its value needs to be 
measured carefully. “It’s a little hard to know 
how well it works, because we don’t do any seri- 
ous evaluation of these programmes,’ says Jaffe. 

That means measuring not just monetary 
returns on investments, but also the social 
return, says Julia Lane, an economist at the 
American Institutes for Research in Washing- 
ton DC who until last year headed NSF efforts 
to measure research outputs. Negative results 
are part of the picture, she says. Private oil 
prospectors might not disclose locations that 
hold no oil, whereas the US Geological Survey 
would, which would benefit society. Similarly, 
unlike the NIH, drug companies may not 
report negative clinical-trial results. 

When mission-driven research is done at 
universities rather than in industry, it has an 
indirect social benefit: students who go on to 
found spin-off companies, says Lane. “The best 
way to translate knowledge is to wrap it up in 
ahuman being,’ she says. “We need to capture 
that piece of information” 

Translational research can have pay-offs for 
basic science, says Matthew Begley, a mechani- 
cal engineer at the University of California, 
Santa Barbara, who, along with his colleagues, 
received $1.2 million from the NSF to study 
high-temperature materials that could be used 
in aeroplane engines. He points out that the 
best scientists hop back and forth between 
basic and applied endeavours, and that basic 
research draws on applied science just as often 
as the other way around — witness the Large 
Hadron Collider, whose construction required 
major feats of engineering to allow it to dis- 
cover the Higgs boson. “The line between 
science and technology is sort of blurry,’ says 
Begley. “But when they're done well, the line is 
very blurry.’ m SEE EDITORIAL P.269 
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Synthetic biologists and 
conservationists open talks 


But worries persist about unintended consequences of tinkering with nature. 


BY EWEN CALLAWAY 


ustralian scientists made headlines 
A* month when they revealed that 

they were close to cloning a frog, 
Rheobatrachus silus, last seen in the wild three 
decades ago. If they succeed, it may take another 
emerging technology to keep that frog alive. 

Synthetic biology aims to endow organisms 
with new sets of genes and new abilities. Along 
with cloning, it has been portrayed in the press 
as a hubristic push to do fantastical things: 
bring back woolly mammoths or resurrect the 
passenger pigeons that darkened the skies of 
North America before they were eradicated by 
nineteenth-century settlers. 

But at a first-of-its-kind meeting, held on 
9-11 April at the University of Cambridge, 
UK, leading conservationists and synthetic 
biologists discussed how the technology could 
be applied in less fanciful ways to benefit the 
planet: to produce heat-tolerant coral reefs, 
pollution-sensing soil microbes and ruminant 
gut microbes that don’t belch methane. Also 
on the list were ways to help frogs to overcome 
chytridiomycosis, the fungal disease threaten- 
ing amphibians worldwide that is thought to 
have contributed to the extinction of R. silus. 

The discussions took place against a back- 
ground of mutual wariness, however. One 
synthetic biologist told the conference that 
he felt as though he were being treated like an 
irresponsible teenager who might accidentally 
wreck the planet. And conservationists bris- 
tled when a synthetic biologist announced that 
those in favour of the technology would win 
simply because they are younger than those 
who are against it. (He apologized the next 
day.) Kent Redford, a consultant for the Wild- 
life Conservation Society in New York and 
organizer of the meeting, made repeated calls 
for comity, plying attendees with free wine. 

Such bickering is a sign that the fields are 
beginning to engage, says Drew Endy of Stan- 
ford University, California, who is widely seen 
as a co-founder of the synthetic-biology field. 
“How are we going to explore the notion that we 

might aspire to rework 


SD NATURE.COM our civilization such that 
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Sweet wormwood could be complemented by 
yeast as a source of the antimalarial artemisinin. 


isn't going to figure it out by itself? he says. 
Producing fuels, foods and medicines 
with microbes is the cutting edge of the field, 
stresses Richard Kitney, a synthetic biologist at 
Imperial College London. “We're not trying to 
produce woolly mammoths,’ he says. 


SOIL SAVER 
As an example of what the field can offer 
conservation, Kitney cites an undergraduate 
project he supervised that was presented at 
the 2011 International Genetically Engineered 
Machine competition, a kind of synthetic- 
biology science fair. Christopher Schoene, 
now at the University of Oxford, UK, and his 
team engineered Escherichia coli so that the 
bacterium would migrate into plant roots 
and produce the growth hormone auxin. In 
greenhouse tests, roots of cress plants that con- 
tained the engineered bacterium grew longer 
than those without, and the soil retained more 
water. Such a bacterium might help to combat 
desertification — the degradation of fertile 
land into desert when soil nutrients are lost. 
But synthetic biology worries some observ- 
ers, who fear what might happen if genes or 
organisms escape from their intended niches. 
Paul Falkowski, a geomicrobiologist at Rutgers 
University in New Brunswick, New Jersey, sees 
value in microbes that can turn carbon dioxide 
into fuel or make fertilizers from atmospheric 
nitrogen, but he worries that industrial-scale 
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production could have drastic consequences, 
such as the inadvertent production of green- 
house gases. “I am rather amazed at the naivety 
of synthetic biologists at the way the world 
works,” he says. 

Many attendees also expressed nervousness 
about the potential of synthetic biology to 
influence land-use patterns. Microbes that 
reduce greenhouse-gas levels might lessen 
the pressure on governments to maintain 
rainforests, they said. Technologies that make 
marginal lands more productive could turn 
undeveloped land into single-crop farms. 

Such shifts are already beginning to occur. A 
project begun by Jay Keasling, a synthetic biolo- 
gist at the Lawrence Berkeley National Labora- 
tory in California, coaxed yeast to produce the 
antimalarial drug artemesinin at industrial lev- 
els (see Nature 494, 160-161; 2013). Much of the 
drug currently comes from cultivation of sweet 
wormwood (Artemisia annua), but Keasling 
believes that synthetic sources will eventually 
force A. annua growers in China and elsewhere 
to cultivate other crops. “I don't make the deci- 
sion about what gets produced,” says Keasling, 
whose company, Amyris in Emeryville, Cali- 
fornia, aims to produce industrial products 
with engineered microbes. “The marketplace 
decides. What I do is provide more options.” 

Concerns could be mitigated by designing 
ways to limit the spread of synthetic microbes. 
Schoene’s team, for example, added a genetic 
safeguard to its E. coli that stops other microbes 
from acquiring the auxin-producing gene. “If 
[safeguards] are being developed with as much 
creativity as other technologies, that would reas- 
sure mea lot,” says Stephen Palumbi, a marine 
biologist at Stanford University’s Hopkins 
Marine Station in Pacific Grove, California. 

Bill Sutherland, a conservation biologist at 
the University of Cambridge, agrees that his 
colleagues need to take synthetic biology seri- 
ously. But he says that a small poll he took at 
the meeting shows that the gulf between the 
two disciplines is not so wide. Both agree that 
more-efficient use of natural resources could 
be an important boon from synthetic biology. 
Both worry about the potential for synthetic 
organisms to harm natural ecosystems. 

One issue that neither group was excited 
or very bothered about: restoring long-dead 
species. “It’s a lot of fun,’ Sutherland says, “but 
it’s not going to save the world.” m 
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QUANTUM CONTEXTUALITY 


Researchers have simplified the 117 contextual questions in the 
original 1967 Kochen-Specker theorem (left) to a more 
manageable 18 (right). Colours indicate duplicated questions. 
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Contextuality theorem couldimprove secure communication. 


BY EUGENIE SAMUEL REICH 


ike Schrédinger’s cat, it was only sup- 
Le to be a thought experiment to 

elucidate the strange mathematics of 
quantum mechanics. 

Now, the 46-year-old Kochen-Specker the- 
orem, which describes the quantum dance of 
observer and observed, has passed its toughest 
test yet in the real world. The test, published in 
February (V. D'Ambrosio et al. Phys. Rev. X 3, 
011012; 2013), is indicative of growing interest 
in the theorem, triggered by new capabilities for 
manipulating photons and cold atoms (see ‘A 
quantum revival’). 

“We can test things that until now were just 
mathematics,’ says Adan Cabello, a physicist 
at the University of Seville in Spain, and a co- 
author of the paper. “We've been waiting for the 
technology.’ Although his team has focused on 
the pure maths of the theorem, follow-up work 
may eventually find practical use in defending 
encrypted conversations against attack, and in 
improving random-number generators. 

The theorem, first published in 1967 by 
mathematicians Simon Kochen and Ernst 
Specker, shows that it is incorrect to assume that 
the results of a quantum mechanics experiment 
are determined before measurements are made 
(S. Kochen and E. P. Specker J. Math. Mech. 17, 
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59-87; 1967). That assumption is valid in clas- 
sical physics; for example, the heat content of 
a cup of tea is unaffected by the thermometer 
measuring it. But it breaks down in quantum 
mechanics, where measurements change their 
subjects in ways that depend on what else is 
being measured — as if a set of thermometers 
conspired to create the heat that they measure. 
This behaviour is called quantum contextu- 
ality. One example of the principle is ‘spooky 
action at a distance; in which the quantum 
states of two particles are entangled such that 
measurements on one particle instantaneously 
influence the other, even if it is far away. 


A QUANTUM REVIVAL 


Citations of the 1967 Kochen-Specker theorem 
have soared since physicists have been able to 
test it with specially prepared atoms and photons. 
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Testing contextuality for particles that are not 
entangled has been hard because the Kochen- 
Specker theorem is so complex: it considers 
the answers to a set of 117 measurements on a 
single quantum particle that can exist in three 
or more quantum states. Preparing photons or 
atoms in more than two quantum states has 
been possible only in the past few years. 

The latest demonstration is the most faithful 
validation of the theorem yet, because it tests a 
set of 18 measurements that are mathematically 
very similar to the 117 described by Kochen- 
Specker (see ‘Quantum contextuality’). Cabello 
and his colleagues manipulated the angular 
momentum and polarization of identical pho- 
tons to create different combinations of four 
quantum states, then watched how the photons 
produced interference patterns in a detector. 
The researchers confirmed that the outcomes 
of the measurements are coordinated, just as 
Kochen-Specker implies has to be the case for 
quantum mechanics to hold. 

Specker died in 2011. Kochen, an emeritus 
mathematician at Princeton University in New 
Jersey, is impressed with the demonstration. 
“We thought it was a Gedankenexperiment, a 
thought experiment,’ he says. 

Tests of the theorem might have practical 
uses. Contextuality holds promise for cryptog- 
raphy, says Cabello, because it implies that a spy 
eavesdropping on messages sent through quan- 
tum systems will influence measurements by 
the agents trying to communicate. Ifthe agents 
find disruptions to the contextuality of their 
messages, they can halt the communication. 

Similarly, the results of quantum contextual 
measurements can generate random numbers, 
which are also valuable in cryptography, says 
Luming Duan, a physicist at the University of 
Michigan in Ann Arbor who has done simpli- 
fied Kochen-Specker tests on cold atoms. Only 
pure quantum measurements are truly random, 
and Kochen-Specker tests can verify that no 
classical effect has interfered with the random- 
number generation, he says. “Contextuality pro- 
vides a way to do this without using entangled 
states’, which are delicate and easily destroyed. 

Some physicists have doubts about the tech- 
nological applications. Adrian Kent, a quan- 
tum theorist at the University of Cambridge, 
UK, says that Kochen-Specker measurements 
are difficult to perform, and that an unknown 
classical aspect of a measurement could upset 
the quantum measurement. That loophole 
could be used to hack encryption based on the 
Kochen-Specker theorem, he notes. 

Fabio Sciarrino, a physicist at the Sapienza 
University of Rome and a co-author of the 
latest paper, says that his group has quanti- 
fied the extent of this loophole in its test. He 
is confident that the test is precise enough to 
prevent classical objects from masquerading 
as quantum ones. “We found very high agree- 
ment between experiment and theory,’ he says. 
That, at least, keeps the strangeness of quan- 
tum mechanics ona sure footing. m 


SOURCE: KOCHEN AND SPECKER (1967)/D'AMBROSIO ET AL. (2013) 
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‘Living fossil? genome unlocked 


The genes of an ancient fish, the coelacanth, have much to reveal about our distant past. 


BY CHRIS WOOLSTON 


he South African fisherman who pulled 

| a prehistoric-looking blue creature 

out of his net in 1938 had unwittingly 

snagged one of the zoological finds of the 

century: a 1.5-metre-long coelacanth, a type 

of fish that had been thought to have become 
extinct 70 million years earlier. 

Since then, scientists have identified two spe- 
cies of coelacanth, one African and one Indone- 
sian. With their fleshy, lobed fins — complete 
with bones and joints — and round, paddle-like 
tails, they look strikingly similar to the coela- 
canths that lived during the Cretaceous period, 
when dinosaurs still roamed Earth. 

Now, an international team of scientists has 
sequenced and analysed the genome of the 
African coelacanth, Latimeria chalumnae; the 
findings are reported on page 311. 

Like lungfish, the other surviving lineage 
of lobe-finned fishes, coelacanths are actually 
more closely related to humans and other mam- 
mals than to ray-finned fishes such as tuna and 
trout. Ancient lobe fins were the first vertebrates 
to brave the land, and the coelacanth genome 
is expected to reveal much about the origins of 
tetrapods, the evolutionary line that gave rise 
to amphibians, reptiles, birds and mammals, 
says lead author Chris Amemiya, a biologist at 
the University of Washington in Seattle. “The 
coelacanth is a cornerstone for our attempt to 
understand tetrapod evolution,’ he says. 

Ending one long-standing argument, 
analysis of the coela- 
canth genome clearly 
shows that it is not 
the closest living fishy 
relative to tetrapods, 

Amemiya says: that 

honour belongs to 

the lungfish. How- 

ever, he adds, the lungfish genome is unlikely 
to be sequenced any time soon because it is 
much larger and more complicated than that 
of the coelacanth. 

Although coelacanths are often called 
“living fossils’, these fish haven't really been 
frozen in time, says co-author Kerstin Lind- 
blad-Toh, a comparative genomicist at Uppsala 
University in Sweden. Comparison of protein- 
coding genes in coelacanths with those of car- 
tilaginous fishes shows that the coelacanths 
have been steadily accruing DNA changes. 

But the rate of change has been remarkably 
slow. The latest analysis shows that the genes 
of modern coelacanths can themselves be 


The African coelacanth is a close relative of the first fishes to venture onto land. 


considered living fossils, says James Noonan, 
a geneticist at Yale University in New Haven, 
Connecticut. 

Scientists already had hints of the coela- 
canth’s sluggish evolution. In a 2012 study, 
researchers in Japan and Tanzania compared 
the DNA of the African and Indonesian coela- 
canths. Specifically, they looked at HOX genes, 
which help to guide embryonic development 
(K. Higasa et al. Gene 505, 324-332; 2012). 
Even though the two species separated, by 
one estimate, perhaps 6 million years ago, 
their genes are remarkably similar. For these 
particular genes, the difference between the 
two species of coelacanth was about 11 times 
smaller than that between the HOX genes of 
humans and chimps, two species that parted 
ways perhaps 6 million to 8 million years ago. 


SLOW TO CHANGE 

It is impossible to say for sure, but the slow rate 
of coelacanth evolution could be due to a lack 
of natural-selection pressure, Lindblad-Toh 
says. Modern coelacanths, like their ancestors, 
“live far down in the ocean, where life is pretty 
stable’, she says. “We can hypothesize that there 
has been very little reason to change.’ And it is 
possible that the slow genetic change explains 
why the fish show sucha striking resemblance 
to their fossilized ancestors. 
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The analysis showed that not all parts of the 
coelacanth genome are slow to evolve. The 
genome has a large number of transposable ele- 
ments — non-coding parts of the genome that 
have an important role in gene regulation — 
that have been moving around in the genome 
at a relatively rapid pace. Non-coding DNA 
could be a significant source of evolutionary 
change, Lindblad-Toh says. But Amemiya adds 
that, for now, the role of non-coding DNA in 
speciation is “speculative”, and its significance 
in coelacanth evolution is not clear. 

As expected, the genome holds clues to the 
genetic changes behind the transition from a 
lobed fin to a tetrapod limb, Amemiya says. For 
example, the analysis found that coelacanths 
and tetrapods share a regulatory gene sequence 
that helps to promote limb development. But 
other findings came as a complete surprise. The 
fish is the first vertebrate found to lack genes 
for immunoglobulin-M, an almost universal 
immune-system protein. Instead, it has two 
genes for a distantly related immune protein 
that evidently “pick up the slack’, he says. 

Further analysis of the genome is bound to 
reveal much about our own distant past, adds 
Noonan. “It will allow us to identify the genetic 
drivers of tetrapod evolution, the genes and 
regulatory elements that are responsible for 
the vertebrate land transition.” m 
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GLOBAL WARMING 


Climate models fail to 
‘predict’ US droughts 


Simulations identify past megadroughts, but at wrong times. 


BY QUIRIN SCHIERMEIER 


CC his would bea fine country if it only 
Tose water,’ observes a settler look- 
ing at the barren west Texas plains. 

“So would Hell,” replies a despairing farmer. 

That old Texas joke probably originated in 
the 1950s, when the state was baked by its most 
relentless drought in recorded history. Last 
year, rain kept clear of the region again, and 
scientists predict that the entire North American 
southwest will become increasingly drought- 
prone as climate change proceeds’. 

Reliable forecasts of future ‘megadroughts’ 
would be a boon to farmers and water manag- 
ers. But results presented last week at the annual 
assembly of the European Geosciences Union 
in Vienna suggest that such forecasts are still 
beyond the reach of current climate models. 

Sloan Coats of Columbia University’s 
Lamont-Doherty Earth Observatory in Pali- 
sades, New York, and his colleagues tested 
whether a state-of-the-art climate model could 
simulate the droughts known to have occurred 
in the southwest during the past millennium. 
The model incorporated realistic numbers for 
factors that affect temperature and rainfall, such 
as atmospheric carbon dioxide levels, changes in 
solar radiation and ash from volcanic eruptions. 

It also incorporated changes in the El Nifio/ 
Southern Oscillation (ENSO), a recurring tem- 
perature anomaly in the tropical Pacific that 
greatly affects weather in the western United 
States and many other parts of the world. 
(The warm phase — El Nifio — often brings 
torrential rain and flooding; the cold phase — 
La Nifia — tends to bring drought.) The team 
then compared the results of its simulations 
with data from the North American Drought 
Atlas, a detailed compilation of droughts based 
on the thickness of tree rings. 

The results were puzzling. Although the 
simulation produced a number of pronounced 
droughts lasting several decades each, these did 
not match the timing of known megadroughts. 
In fact, drought occurrences were no more in 
agreement when the model was fed realistic 
values for variables that influence rainfall than 
when it ran control simulations in which the 
values were unrealistically held constant. “The 
model seems to miss some of the dynamics 
that drive large droughts,” says study partici- 
pant Jason Smerdon, a researcher at Lamont- 
Doherty who studies historical climate patterns. 
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Other climate models tested by the team 
fared no better, he says. In particular, the mod- 
els failed to reproduce a series of multi-decadal 
droughts that occurred in the southwest dur- 
ing the Medieval Climate Anomaly, a period 
between ap 900 and 1200 when global tem- 
peratures were about as high as they are today. 

The problem may lie in the models’ inability 
to reproduce the cycling between the ENSO’s 
El Nifo and La Nifia phases, especially given 
that many scientists think that La Nifia is the 
major driver of drought in the southwest. The 
ENSO “behaves much messier in the real world 
than in climate models’, says Jessica Tierney, a 
climate scientist at the Woods Hole Oceano- 
graphic Institution in Massachusetts who has 
investigated the role of the ENSO in East Afri- 
can rainfall variability”. “We're not sure how it 


has varied in the past, 
“The model and we don’t know 
seems to miss how it might change 
some of the in response to climate 
dynamics that change. This is really 
drive large one of the big uncer- 
drou ghts. ad tainties we're facing” 


In addition to their 
failure to reproduce El Nifio and La Nita, exist- 
ing models do not fully capture other factors 
that influence rainfall, such as clouds and vege- 
tation. But Smerdon adds that the atmospheric 
and oceanic dynamics that inhibit rainfall and 
favour prolonged drought may be essentially 
random and so almost unpredictable. 

Last week's findings highlight the broader 
challenge of predicting how precipitation pat- 
terns will change as the global climate warms. 
Models are often at odds over the very direc- 
tion of regional changes. For example, different 
projections prepared for the Colorado Water 
Conservation Board disagree on whether 
mean precipitation in the state will increase or 
decrease by 2050 (ref. 3). 

But the uncertainties don't change the larger 
picture, scientists say. “Climate models are not 
perfect, but they do the big things really well? 
says Tierney. “We can be pretty confident that 
the southwest will warm and that water will 
become scarcer.” m 


1. Seager, R. et al. Science 316, 1181-1884 (2007). 

2. Tierney, J. E. Smerdon, J. E., Anchukaitis, K. J. & 
Seager, R. Nature 493, 389-392 (2013). 

3. Ray, A. J. et al. Climate Change in Colorado: A 
Synthesis to Support Water Resources Management 
and Adaptation (Colorado Water Conservation 
Board, 2008); available at go.nature.com/xs8471. 
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SPLINTERS 


OF THE 


AMAZON 


Decades after Thomas Lovejoy isolated fragments of the Brazilian rainforest ina 
grand experiment, researchers are building on his legacy around the world. 


BY 


JEFF TOLLEFSON 


cologist Thomas Lovejoy tucks his trousers into his 

socks with a casual warning about chiggers and then 

hikes off into the Amazon jungle. Shaded by a tall 

canopy and dense with ferns and underbrush, the old- 

growth forest looks healthy, but Lovejoy knows bet- 
ter. Three decades ago, the surrounding forest was mowed 
down and torched as part of a research project, and the 
effects have spread like a cancer deep into the uncut area. 
Large trees have perished. The spider monkeys have moved 
out, as have the army-ant colonies, and many of the birds 
that depend on them. 

Lovejoy and his team have been studying this 10-hectare 
fragment of forest since the late 1970s as part of the largest 
and longest-running experiment in tropical ecology. In col- 
laboration with ranchers, they cleared the trees around this 
and ten other plots of varying size to create islands of intact 
forest. The researchers have been monitoring the plots ever 
since, documenting how deforestation harms the adjacent 
untouched forest as specialist plants and animals gradu- 
ally give way to generalists and pioneer species that prefer 
disturbed habitat. “We are chronicling the simplification 
of these forests,” says Lovejoy, a professor at George Mason 
University in Fairfax, Virginia. 

Covering roughly 1,000 square kilometres in an area 
north of Manaus in the central Amazon, the experiment 
was set up to test fundamental theories about the viability of 
small, disconnected ecosystems. By documenting pervasive 
changes in the forest fragments, Lovejoy and his co-workers 
provided the first hard data that conservationists needed to 
promote the preservation of extensive areas of intact forest. 
“Tt’s the most important ecological experiment ever done,” 
says Stuart Pimm, a conservation ecologist at Duke Uni- 
versity in Durham, North Carolina, who has collaborated 
on the project. “We knew that small and isolated was bad, 
but we needed to know how bad.” 

The researchers are now exploring the long-term effects 
of habitat fragmentation, but the ecological record there 
is ironically threatened by forest that is taking over aban- 
doned pastures. Although Lovejoy has struggled to main- 
tain financing for long-term monitoring, the US National 
Science Foundation is breathing new life into the project by 
funding the team to isolate some of the plots anew. 

The experiment has also helped to train and inspire a 
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generation of ‘fragmentologists, who are working around 
the world to understand the cascade of ecological impacts 
that follow human development. Most notably, in early 
April, an international team started chopping down trees 
in Borneo as part of an nearly £6-million (US$9-million) 
experiment that replicates and extends the Brazilian one. 

“The Amazon experiment changed the game,” says 
Rob Ewers, principal investigator on the Borneo project 
at Imperial College London. “I like to think of our project 
as the next step.’ 


THE AMBASSADOR 


“Welcome to Camp 41,’ says Lovejoy, beaming at a group 
of guests he invited to tour the experiment — and do alittle 
bird-watching — over New Year's Eve, an annual tradition. 
Fit at 71, he has a slight paunch, a crop of thinning hair and 
pale skin that is a touch reddish from the heat and the hike 
to his forest base. Lovejoy offers a quick tour of the open- 
air shelters that house hammocks and dining facilities as 
well as the bathrooms, showers and a makeshift pool down 
a trail by the stream. Over the years, he has entertained a 
long list of high-profile guests here, ranging from Al Gore 
(when he was a senator) to actor Tom Cruise and high- 
ranking Brazilian officials. 

Lovejoy has always served as a scientific ambassador 
and chief fund-raiser, and left the fieldwork to others. After 
cleaning up from a day tramping around the forest, he sits 
beneath a cashew tree and begins plying his guests with 
caipirinhas, the national cocktail of Brazil. Peering over 
wire-framed glasses, he guides conversations about the 
strange beauty of tropical creatures, environmental policy 
and the history of science and development in the Amazon. 
Darkness falls, and an orchestra of frogs claims the forest. 

Lovejoy arrived in the Amazon to study birds as a grad- 
uate student from Yale University in New Haven, Con- 
necticut, in 1965, just as concerns about development in 
the region were rising among scientists and politicians. 

That same year, Brazil enacted its 


D> NATURE.COM modern Forest Code, which at the 
To see a slideshow time required ranchers and farmers 
about the Brazil in the Amazon to maintain a ‘legal 
experiment, visit: reserve on 50% of their land (the 
go.nature.com/e480pz legal reserve is now 80%). 
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Two years later, entomologist Edward O. Wilson and 
biologist Robert MacArthur published their influential The 
Theory of Island Biogeography, which laid the foundations 
for the modern understanding of species diversity and rates 
of extinction in isolated habitat, whether surrounded by 
water or by agricultural fields. 

Soon after Lovejoy earned his PhD in 1971, ecologists 
became embroiled in what became known as the SLOSS 
debate, which stood for ‘single large or several small. The 
question was whether it would be better to protect mas- 
sive continuous landscapes or many smaller biodiversity 
hotspots. Lovejoy thought about the Brazilian law and real- 
ized that the legal reserve could provide a way to probe 
these questions. “The ranchers were going to clear the land 
anyway, he says. “My crazy idea was that maybe you could 
arrange the 50% and create a giant experiment.” 

The project kicked off in 1978, with $500 a month from 
conservation group the WWE and the support of the Bra- 
zilian National Amazonian Research Institute (INPA) in 
Manaus. Lovejoy hired another former student from Yale, 
ecologist Rob Bierregaard, to run the project. A year later, 
he and a team of Brazilian scientists began surveying the 
forests areas they were planning to isolate, which came in 
sizes of 1, 10 and 100 hectares (see ‘Fractured forest’). The 
first wave of machetes and chainsaws came through in 
June 1980, and in September, Bierregaard’s team walked 
the perimeter of the plots, dripping burning rubber onto 
forest debris. When the conflagration died down, the first 
two square patches of old-growth rainforest were left stand- 
ing amid swathes of smoking embers that remained hot 
enough to cook the crew’s beans for days. 

The early phase of the experiment was hardly smooth. 
The Brazilians complained that it was too much of a US 


initiative, and the ranchers were slow to clear the rest of 
their land. One year passed, then two. “It was really frus- 
trating,’ says Bierregaard, who is now at the University of 
North Carolina at Charlotte. “We were publishing totally 
unreplicated results from the 1- and 10-hectare reserves.” 

By 1983, rather than waiting for the ranchers, Lovejoy 
secured more funding from the WWF to create another 
pair of fragments. The results began rolling in immediately, 
with the edges of the plots showing a substantial loss of 
key species. Yet, as the experiment grew, Lovejoy’s team 
soon had more data than it could deal with, and in 1996 he 
brought on Bill Laurance to help make sense of the plant 
data. In 1997, Laurence and the team reported that up to 
36% of the biomass in the first 100 metres of the forest 
fragments had disappeared in 10-17 years of isolation’. 
“Tt really taught people how edge effects are driving rapid 
changes in ecology,’ says Laurance, who is now stationed 
at James Cook University in Cairns, Australia. 

The main drivers are sunlight and air circulation. As 
the pastures and forest edges heat up each day, the air over 
those regions rises, drawing cool moist air out of the forest. 
The hot dry air takes a toll on large hardwood trees such as 
mahogany and ebony. The open fields also expose the forest 
to wind, which blows down trees and further opens up the 
canopy. Over time, these gaps are filled with fast-growing 
trees and vines. These pioneer species eventually seal off 
the forest like a scab, helping to delay further impacts, but 
neither the carbon density nor the diversity of the forest 
recovers quickly. Today, the researchers continue to track 
these effects as they work their way through the forest. 

Those early results suggested that scientists were 
underestimating the broader impact of fragmentation, 
and in 1998, Laurance extrapolated the findings across the 
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KNEW THAT 
SMALL AND 
ISOLATED 
WAS BAD, 
BUT WE 
NEEDED 10 
KNOW HOW 
BAD.” 


18 APRIL 2013 | VOL 496 | NATURE | 287 


© 2013 Macmillan Publishers Limited. All rights reserved 


| NEWS FEATURE 


KANSAS 
In 1984, researchers let 
different-sized patches 
of an agricultural field 
go fallow to investigate 
how plant and animal 
species would colonize 


the plots. BRAZIL 


Since 1979, ecologists 
have been monitoring 
squares of primary 
forest north of Manaus 
that were surrounded 
by deforested zones. 


A FAMILY 
OF FRAGMENTS 


A pioneering study in the 

Amazon spawned nearly 

two dozen experimental 

studies of isolated 

habitats in other parts 

of the world. ; . . : . 
tropics. His team’s calculations suggested that the biomass 


loss around forest edges could produce up to 150 million 
tonnes of carbon emissions annually” — exceeding emis- 
sions from the United Kingdom. 

In 2003, Lovejoy and his fellow fragmentologists took 
their first stab at one of the questions that had originally 
inspired the project: how big is big enough? Documenting 
a 50% decline in the number of bird species living beneath 
the canopy in the 100-hectare plots over the first 15 years 
of isolation, they derived a kind of half-life for extinction: 
the time it takes to lose half of the species increases roughly 
tenfold with a 1,000-fold rise in the area of a reserve’. 

The calculations were based only on birds and may not 
capture the ecological dynamics of larger reserves. But for 
Lovejoy, they suggest that a reserve would need to be on 
the order of 100,000 hectares in area to maintain relatively 
stable levels of biodiversity. Given that it is impractical in 
most places to set aside such large areas, this suggests that 
most fragmented landscapes around the globe may be 
doomed to a continuing decline in biodiversity. 

Yet, the fragment experiment also points to a possible 
solution: the secondary forests that have sprung up in 
abandoned pastures. The regrown forests created wildlife 
corridors, which allowed army-ant colonies, birds and 
howler monkeys and other small mammals to migrate 
from intact forest into the experimental plots. For Pimm, 
those findings served as a call to action. In 2005, he created 
anon-profit conservation organization called SavingSpe- 
cies, with a goal of identifying biodiversity hotspots such as 
Brazil's Atlantic rainforest that would benefit from habitat 
corridors — a kind of lifeline to the remaining forest. 

“This is where the science comes together with the 
strategic and tactical conservation efforts, Pimm says. 
In 2007, his group and its partners purchased a ranch 
that separated a small population of endangered golden 
lion tamarins from a larger forested area. After the team 
moved out the cattle and replanted some areas, secondary 
forest started to grow. Last year, the researchers saw the 
first evidence of golden lion tamarins and cougars moving 
through former ranchland. “Our bet on studying forest 
fragments paid off? he says. 

Looking forward, Lovejoy says that many crucial 
questions remain about the processes playing out in the 
fragmented ecosystems. How many of the species in the 
plots are doomed to extinction? How quickly and deeply 
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BORNEO 
This month, 
researchers started 
clearing around 
patches of trees 
within palm-oil 
plantations and 
secondary forests. 


will these impacts move through the fragmented forests? 
Will rapid shifts in insect and other animal populations 
drive long-term changes in seed dispersal and thus plant 
diversity? And what role will global warming have? 

Continued monitoring of the fragments could produce 
some answers, but the secondary growth around the plots 
has undermined the viability of the experiment. The US 
National Science Foundation provided money for a new 
bout of clearing later this year, which is part of a larger 
$450,000, five-year grant to continue bird research there. 
“This grant will extend our record to nearly 40 years,” says 
Phil Stouffer, an ornithologist at Louisiana State University 
in Baton Rouge and principal investigator on the grant. For 
Lovejoy, it is a new lease on life for the experiment. 

As the project continues, long-term data from the 
fragments and from the larger control plot in the adja- 
cent primary forest could be invaluable for studying the 
impacts of global warming, says Adalberto Val, who heads 
the INPA, which provides part of the half-million-dollar 
or so annual operating costs of the experiment. The rest 
comes from the Smithsonian Tropical Research Institute 
in Panama and various foundations. He calls the forest- 
fragments project a “scientific treasure”. 


GLOBAL SPREAD 


In 1984, a team led by Robert Holt, now at the University of 
Florida in Gainesville, implemented one of the first follow- 
on studies, focusing on experimentally designed patches of 
secondary regrowth amid the agricultural fields of Kansas 
(see ‘A family of fragments’). By 1990, another four projects 
were under way, anda survey published in 2003 identified 
a total of 21 fragmentation experiments of various scales 
and durations’. Andrew Gonzalez, an ecologist at McGill 
University in Montreal, Canada, has replicated Lovejoy’s 
experiment at the bench- and chamber-scale with isolated 
patches of moss; he has used the results to inform his own 
work on wildlife corridors in Montreal’. 

The experiments have produced a range of results, but 
most of them confirmed significant impacts along the 
edges of the fragments that broadened over time. And the 
subsequent studies have reinforced the findings that habi- 
tat corridors can help to sustain the isolated fragments. 

Lovejoy’s latest scientific progeny — third-generation 
fragmentologist Ewers — initially worked in the Amazon 
as a 25-year-old postdoc under Laurance. Ewers first visited 
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Camp 41 in 2004 while conducting unrelated research on 
deforestation. He is now setting up a similar experiment in 
the state of Sabah in Borneo. Whereas Lovejoy’s experiment 
focused on primary old-growth forest, Ewers wanted to 
observe the evolution of landscapes that have already been 
degraded and transformed by humans — something more 
representative of modern environmental realities. He set up 
fragments in logged forests and palm plantations as well as 
plots within forest reserves to investigate the ecological and 
conservation value of different landscapes. 

“The general idea of Lovejoy’s project was just 
breathtaking 30 years ago,” says Ewers. “But we have new 
questions today, and I think we are in a much better place 
to make sense of these data in the long run” 

His initiative, known as the Stability of Altered Forest 
Ecosystems (SAFE) project, is receiving some £6 million 
pounds in core funding over ten years from the Sime Darby 
Foundation in Kuala Lumpur, the philanthropic arm of 
one of the world’s largest palm-oil producers. Forty-two 
plots mirroring the sizes used in the Amazon project will 
be located at various distances from the surrounding for- 
ests. Dozens of scientists have been surveying the sites, and 
the project now has data on 3,000 to 4,000 tropical species, 
from ants and beetles to birds, bees and trees — a database 
much larger than the one Lovejoy and Bierregaard were 
able to compile at the outset of their experiment. After a 
year’s delay, loggers fired up the chainsaws on 4 April. 

The unpublished baseline data show a landscape 
severely affected by development. For instance, the aver- 
age amount of carbon locked up in trees in the primary 
forests has been estimated at 243 tonnes per hectare, com- 
pared with 49 tonnes per hectare in logged forest and just 
4 tonnes per hectare within palm plantations. And yet, 
while many logged forests have been hard-hit ecologically, 
they still show pockets of remarkable biodiversity, includ- 
ing all five of the native cat species. 

Ed Turner, an ecologist at the University of Cambridge, 
UK, who helped to set up the experiment, says that the 
project is designed to inform discussions about how to 
conserve habitat in a region dominated by palm-oil pro- 
duction. “The lessons from SAFE have the potential to 
have a very far-reaching effect,’ he says. 

Another component of the Borneo project will focus on 
how nitrogen and other nutrients move through soils and 
plants, with the goal of understanding the biogeochemi- 
cal processes at work throughout the forests as they are 
affected by fragmentation. The UK National Environmen- 
tal Research Council is expected to announce in the coming 
months nearly £5.7 million for a series of projects focused 
on the relationship between biodiversity and biogeochemi- 
cal cycles. For example, initial surveys have suggested that 
leaf litter decomposes roughly twice as fast in logged and 
primary forests as it does in palm-oil plantations. Once 
scientists have isolated fragments in each of these systems, 
they will be able to track changes in the decomposition rates 
as edge effects take hold, then relate those changes to soil 
nutrient cycles and impacts on plant communities. 

“It’s about doing fundamental science to help us 
understand what happens to a forest when you change it 
in these ways,’ says Ken Norris, an environmental biolo- 
gist at the University of Reading, UK, and biodiversity 
theme leader for the UK National Environmental Research 
Council. “And if you are going to do ecosystems science, 
you need experiments outdoors at the scale at which these 
systems work? 

Surveying the ongoing work, Lovejoy is pleased to see 


that his crazy idea still has legs. The project in Borneo will 
advance the field, perhaps leading to new collaborations 
and comparative studies with researchers at his own exper- 
iment in the Amazon. “I started out thinking this would 
be a 20-year experiment, and then I would get the answer 
and we would be done,” he says, gazing into the rain one 
afternoon at Camp 41. “It turned out to be more complex. 
So many new questions arose.” 

In the end, he says that much of the project’s impact — 
and his own — has come from broader educational and 
advocacy efforts that have advanced conservation initia- 


2 


a 


tives in Brazil and around the world. He notes that Brazil 
has now protected nearly half of its share of the Amazon, 
more than scientists like himself could have hoped for 
several decades ago. He also points to the hundreds of 
Brazilian scientists who have moved through the project, 
including people such as Rita Mesquita, a researcher at the 
INPA who first came to the project as a student in 1985 and 
rose up to become scientific coordinator and an important 
state environmental official. 

Lovejoy says that the scientific value of the Amazon 
fragment experiment only increases with time. Basic ques- 
tions about the rate of species extinctions in fragmented 
habitat still plague the field of ecology, and the fragments 
provide a unique way to explore the issue. Ultimately, he 
envisages raising enough money to buy the ranchland sur- 
rounding the project and then convert it into an educa- 
tional facility for science and ecotourism, and he has a new 
advisory board that is looking into options. 

“If we can get this place stabilized and institutionalized, 
we could do a lot of things,” he says. One way or another, 
he seems confident that the research will continue. “Even 
if my plane goes down on the way to Miami, I think it will 
happen.” 

Lovejoy falls silent, and the murmur of rainfall envelopes 
Camp 41. After a pause, he heads for the hammocks to join 
his guests for an afternoon nap. Soon enough, the rains 
will stop and Lovejoy will once again don his safari hat and 
binoculars, and set off to explore the fragmented forest. m 


Jeff Tollefson covers energy and environment for Nature 
from New York. 
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The Biological Dynamics 
of Forest Fragments 
Project in Brazil has 
experimental plots of 
primary forest ranging in 
size from 1 hectare to 
100 hectares located 
within cleared pastures. 
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Polio’s moving target 


Finding and vaccinating Nigerian nomads may be 
one of the last obstacles to the eradication of polio. 


ohammed Abubakar’s home is not 
M: any map — at least not yet. To 
reach his settlement in a desolate 
part of northern Nigeria, four health workers 
creep over deep-rutted roads in an old Peugeot 
for an hour, then ride motorcycles over narrow 
dirt trails for another 30 minutes — stopping 
only for the odd herd of cattle. Finally, they 
spot a cluster of mud-brick huts, known to the 
Fulani nomads who live there as a ruga. 
“As-salamu alaykum,; — peace be with you 
— says Ardo Babangida, a traditional leader 
accompanying the team. Children swarm 
around the visitors, and Daniel Santong, an 
easy-going veterinarian and leader of the group, 
asks to meet Abubakar, the head of the house- 
hold. Meanwhile, a young colleague whips out 
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a smart phone and uploads the settlement’s 
Global Positioning System coordinates into a 
database. Abubakar arrives, clad in a lavender 
tunic and white skull cap, and Santong tells 
him that they are trying to eliminate polio in 
nomadic people. Abubakar clasps his guest's 
hands in appreciation. He says that he cannot 
remember the last time that health workers 
came to vaccinate his children. It is a story that 
Santong and his colleagues are now accustomed 
to hearing, even though door-to-door immuni- 
zation campaigns happen 

on a near-monthly basis 


in the region. To watch a video 
These dusty paths are _ report on this 
the front lines of polio _ story, visit: 


eradication. A 25-year, 
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Vaccinators reach a 
child in Bauchi, Nigeria. 


US$10-billion global 
effort has taken the 
number of polio cases 
from hundreds of thousands per year to just 
hundreds, but it is now struggling to stamp the 
virus out of its final strongholds in Pakistan, 
Afghanistan and Nigeria, where transmission 
has never been interrupted. Of these, Nigeria 
was the only one to see an increase in cases 
from 2011 to 2012, and public-health experts 
worry that the virus’s recalcitrance here will 
prevent global eradication, and eventually lead 
to a wider resurgence of the disease. 

The barriers to polio eradication in Nigeria 
are complex and numerous. The country does 
not have a working public health-care system, 
and some local government officials are less 
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than committed to the cause. In urban centres 
in the north, widespread distrust of the govern- 
ment leads many parents to refuse vaccination 
for their children. What is more, in February 
several polio workers were murdered — for 
unknown reasons — at health clinics in Kano, 
northern Nigeria’s largest city. 

But epidemiologists have identified one 
barrier that might be overcome cheaply and 
safely: locating and counting remote popula- 
tions, including the nomadic livestock herders 
who drift through the region with the changing 
seasons. Records of their numbers and move- 
ments are incomplete, but the population is 
thought to include hundreds of thousands of 
young children, many of whom have received 
none or only some of the multiple oral polio 
vaccine doses required to achieve full protec- 
tion. Proponents of the programme say that 
nomads are a polio reservoir, spreading disease 
around the country during their migrations. 
So in June 2012, the National Stop Transmis- 
sion of Polio (N-STOP) programme, organized 
through the Global Polio Eradication Initiative 
(GPEI) and supported by the Nigerian govern- 
ment, started a census of Fulani nomads and 
other hard-to-reach populations, as part of a 
global emergency action plan against polio. 

“Until we solve the problem of these 
unvaccinated nomads, we’ not going to fix 
polio,” says Frank Mahoney, a veteran field 
epidemiologist leading the project from Abuja. 
“We're not going to be able to eradicate it” 


ROADBLOCKS AHEAD 

On a scorching, cloudless December day, a 
procession of several hundred livestock has 
taken over a road running between the states of 
Bauchi and Kaduna in northern Nigeria. Men 
walk alongside their cows and sheep, while 
women and children creep along on motor- 
cycles. It is the height of the dry season, the land- 
scape is parched, and the group is heading south 
to graze its livestock. Lere, a local government 
area (LGA) in Kaduna is travelled by a sizeable 
population of Fulani pastoralists like these dur- 
ing their biannual migrations. It is also one of 
dozens of ‘high-risk LGAs’ on which N-STOP 
teams are focusing their efforts. No polio cases 
have yet been detected here, but Lere is not far 
from the borders of Kano and Bauchi, which 
both recorded cases in the past year. 

Mobile and remote populations are often 
strongholds for disease. Somali nomads con- 
tained some of the final cases of smallpox, and 
the vaccination of herds in remote patches of 
east Africa was crucial to eradicating the cattle 
disease rinderpest, completed in 2011. 

In Nigeria, Fulani nomads receive little 
education or health care from the government. 
“Nobody looks after them, nobody takes pri- 
mary health care to them, nobody remembers 
they exist,” says Endie Waziri, a member of one 
of the N-STOP teams. When polio vaccination 
workers do visit their remote settlements, she 
says, they tend to visit only the first ruga they 
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Many Fulani migrate twice a year, making it difficult for vaccination programmes to find them. 


see and not look for others in the vicinity. 
Nigeria made significant headway against 
polio after starting an eradication programme 
in 1996. But those gains were erased in 2003, 
when Muslim clerics in the northern state of 
Kano called for a boycott of the polio vaccine 
over fears that the eradication campaign was a 
Western conspiracy to sterilize the population. 
Soon, Kano, Kaduna and other northern states 
had halted all polio vaccination campaigns. 
The boycott ended a year later, but by then 
polio had exploded across northern Nigeria 
and started to seep into nearby countries, such 
as Cameroon and Cote d'Ivoire, that had previ- 
ously vanquished the virus. Nigeria has since 
made lurching progress against the disease. 
Cases fell from more than 1,000 in 2006 to 21 


“If we hit these areas, we get a much bigger 
bang for the buck,’ says Chima Ohuabunwo, 
a Nigerian epidemiologist who took a sabbati- 
cal from his position at Morehouse School of 
Medicine in Atlanta, Georgia, to serve as the 
field coordinator of the nomads project. 

Still, the lack ofa working public health-care 
system has been a problem, says Heymann, 
who points out that neighbouring countries 
with large nomadic populations such as Chad 
have successfully interrupted transmission. 
“Other countries with migrant populations 
have done the job,’ he says. India, for instance, 
offered polio vaccinations at train stations to 
catch migrants, and Chad offers combined 
veterinary and polio vaccination services to 
encourage Fulani nomads — who depend on 


“Other countries with migrant 
populations have done the job.” 


in 2010, before rising again to 122 last year. 
Many more cases probably went undetected. 

Religious opposition to vaccines among 
settled populations has now given way to 
refusals driven by disenchantment. “People 
want things other than polio vaccination,” 
says David Heymann, chairman of the advi- 
sory board for Public Health England and the 
former head of the polio-eradication efforts 
for the World Health Organization (WHO). 
“They can't understand why people are coming 
once a month to give them vaccination when 
what they want are treatments for their chil- 
dren with fever or diarrhoea.” 

The nomads, however, rarely refuse polio 
vaccination for their children, and they are eager 
to receive other health and veterinary services. 
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the health of their livestock — to take part. 

N-STOP’s census programme was designed 
to support on-going vaccination efforts, 
uncovering areas of need and directing 
resources and local vaccination teams to them. 
In fact, so as not to interfere with local efforts, 
N-STOP teams didn’t bring vaccine stocks 
with them until the federal government asked 
them to. Since August, the N-STOP surveys 
have uncovered more than 32,000 settlements 
and identified more than 700,000 children 
— nearly 40,000 of whom had never been 
vaccinated against polio. 

Although little more than 3% of the 122 polio 
cases reported last year in Nigeria occurred in 
nomad children, the teams discovered more 
than 100 probable cases that went unreported, 


18 APRIL 2013 | VOL 496 | NATURE | 291 


| NEWS FEATURE 


Last holdouts ‘9 = 


In 2012, five countries worldwide 
reported cases of wild polio virus { 
(as opposed to cases relatedtoa ., » 
vaccine strain). Sustained ; 
transmission continues only in 
Pakistan, Afghanistan, Nigeria 
and until recently, India. Only 
Nigeria saw the number of cases 
rise between 2011 and 2012. 
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supporting the idea that nomads form an 
important link in the chain of transmission. 
When not on long migrations, Fulani nomads 
interact with other people at markets. More 
than one-third of confirmed polio cases in 2012 
were among children who lived in close prox- 
imity to nomadic communities. And nomad 
migration routes are hotspots of low vaccina- 
tion coverage, according to data from the US 
Centers for Disease Control and Prevention 
(CDC) in Atlanta, Georgia. The surveys are 
“really a strategy that’s been missing in the tool 
box fora long time, and it needs to be urgently 
done’, says Mahoney. But Mahoney also notes 
that the project is a work in progress. Emma- 
nuel Musa, the WHO coordinator in Lere, is 
not sure that it will be feasible for his district's 
vaccination team to reach all the nomad com- 
munities that N-STOP teams are uncovering. 
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Mistrust of the government 
in Nigeria’s northern states 

leads to high rates of ( 
vaccine refusal, and 
nomadic children are often 
missed by vaccinators. 
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explain some of the tensions. The public- 
private effort, which includes the WHO, CDC, 
the United Nations Children Fund (UNICEF), 
Rotary International and the Bill & Melinda 
Gates Foundation, has become the world’s 
costliest public-health initiative, and one of the 
longest running. Since it started in 1988, it has 
missed three deadlines for halting transmis- 
sion — in 2000, 2005 and 2012 — and it now 
burns roughly $1 billion per year chasing the 
last remaining pockets of disease. 


CAUSE FOR OPTIMISM 

Organizers are optimistic about winning the 
war, however. India was long thought of as 
the Waterloo of the initiative because its high 
population density and poor sanitation pro- 
vided ideal conditions for the virus to spread. 
But it celebrated its second year without a 


“Tf we hit these areas, we get amuch 
bigger bang for the buck.” 


“There are inadequate funds,” he says. 
Atacatch-up in December, Fureratu Zakari, 
the WHO coordinator for Kaduna state, also 
questioned whether the mapping data will 
be enough to allow health workers to locate 
nomadic communities. “They should be show- 
ing teams where settlements are and not just 
writing it down,’ she tells Olaniran Alabi, the 
programme's field coordinator for Kaduna. 
Zakari complains that the N-STOP pro- 
gramme isn't coordinating its day-to-day activ- 
ities with local health authorities, which could 
result in confusion and duplicated efforts. 
The sheer scale of the GPE] initiative may 


292 | NATURE | VOL 496 | 18 APRIL 2013 


case of polio in January. And Pakistan and 
Afghanistan both saw significant reductions 
in cases between 2011 and 2012 (see ‘Last hold- 
outs’), despite deep security challenges that 
limited the reach of vaccination campaigns. 
Programme officials in Nigeria say that 
national and state governments are increas- 
ingly committed to eradication, but that com- 
placency and corruption are still common 
among the officials in charge of local immu- 
nization campaigns. In October 2012, for 
instance, dozens of officials in Kano state were 
fired for treating the eradication programme 
as a “money-making venture’, in the words of 
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India, now officially polio 
free, last recorded a case 
in January 2011. 
Neighbouring Pakistan 
and Afghanistan still serve 
as reservoirs for outbreaks. 
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the state’s governor, Rabiu Musa Kwankwaso. 

Counting and vaccinating nomads will not 
solve all Nigeria's polio troubles, but it is easier 
to achieve than tackling domestic terrorism, 
vaccine refusal and other challenges. 

When asked how N-STOP will measure the 
success of the nomad programme, Mahoney 
puts it simply: “Stopping polio transmission. 
That’s the big indicator.” But he also points 
to the thousands of settlements and children 
that have been mapped and counted so far. 

As Nature went to press, Nigeria had recorded 
11 cases of polio this year; it had 17 in the same 
period last year. President Goodluck Jonathan 
has vowed to bring that number to zero before 
his term ends in 2015. Michael Galway, senior 
programme officer at the Bill & Melinda Gates 
Foundation in Seattle, Washington, is optimis- 
tic that the push to reach remote populations is 
paying off. “The work of the nomads project has 
been extremely beneficial in opening the eyes 
of the [polio eradication] programme to this 
additional piece of the puzzle,’ he says. 

The Nigerian project could hold lessons for 
future efforts to eradicate disease (measles is 
on some agendas), which are likely to hinge 
on reaching mobile and remote populations. 

Targeting vaccination efforts on nomadic 
children is the right strategy, says Paul Rutter, 
a spokesman for the independent monitoring 
board set up to evaluate global efforts at polio 
eradication in 2010. But it will be the people 
behind it who ultimately dictate its success. 
“This kind of dogged determination to reach 
every last child will be what rids Nigeria of 
polio?” m 


Ewen Callaway writes for Nature from 
London. 


SOURCE: GLOBAL POLIO ERADICATION INITIATIVE 


RON Nf 
We must not forget E. O. Wilson’s Show of = 4 ks ah. What sorts of 
the lessons of Willi Hennig, pragmatic, passionate advice rare treatises charts Si 


father of cladistics p.295 


to young scientists p.297 


Ip 


UK-DMC2/DMC INTERNATIONAL IMAGING 


history of anaesthesia p.298 “Y 


( 


Choose satellites to 
monitor deforestation 


Mlegal logging threatens tropical forests and carbon stocks. Governments must 
work together to build an early warning system, say Jim Lynch and colleagues. 


ropical deforestation contributes 

12% of total anthropogenic carbon 

dioxide emissions globally’. Illegal 
logging is costing nations tens of billions of 
dollars each year. Although governments are 
making headway on agreements to stop this 
destruction, so far there is no coherent plan 
to monitor tropical forests on the scale or 
timescales necessary to do so. 

Incentives are being negotiated for 
states to implement the United Nations 
REDD+ framework: Reducing Emissions 
from Deforestation and Forest Degrada- 
tion, extended to include conservation, 
sustainable management of forests and the 


enhancement of forest carbon stocks. The 
Intergovernmental Panel on Climate Change 
(IPCC) is also developing forest remote- 
sensing plans for consideration by the 
19th Conference of the Parties (COP-19) to 
the United Nations Framework Convention 
on Climate Change (UNFCCC), which will 
be held in Warsaw this November. 

Satellites provide the only means of viewing 
vast forest areas regularly — the tropics cover 
almost half of Earth's land area. But basic deci- 
sions have yet to be made on which Earth- 
observing systems should be used and how 
forest data should be monitored, reported and 
verified. In our view, the mapping strategies 
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proposed so far are too sparse and slow, mak- 
ing it impossible to identify forest damage 
until at least a year later. 

We believe that an early warning system 
is needed to allow authorities to stop illegal 
logging quickly. Two strategies are necessary 
to achieve this: first, a new set of tropical 
orbiting radar satellites that can ‘see’ through 
clouds to monitor global forests daily; sec- 
ond, a plan for existing satellites to assess 
forest carbon stocks several times a year, to 
account for seasonal variations. 

The REDD+ working group, which 
meets in Bonn, Germany, from 29 April to 
3 May, must agree on a comprehensive, 
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> rapid-response global monitoring system 
in time for it to be ratified at COP-19. 


REAL-TIME MONITORING 

More than one billion people in the tropics 
depend on forests for their livelihood. The 
illegal logging trade is estimated to be worth 
between US$30 billion and $100 billion annu- 
ally, with governments losing $10 billion each 
year in tax income’. Stolen wood is estimated 
to depress world timber prices by up to 16%. 

As well as curbing these losses, imple- 
menting REDD+ and halving deforestation 
rates by 2050 would prevent the release of 
50 petagrams of carbon in total into the 
atmosphere”, saving a potential warming 
of 0.14°C. This is a small but crucial step 
towards potentially much larger carbon 
savings. 

Implementing REDD+ will require many 
facets of the carbon cycle to be monitored, 
including tree cover and carbon emissions 
from felling. There is also a need to assess 
the state and health of forests by analysing 
colour changes due to chlorophyll loss. Bio- 
mass, dead organic matter and soils must be 
included in the carbon budget. 

Because different types of forest vary in 
their capacity to store carbon, changes in 
land use must be recognized. For example, 
Indonesia's policy of converting its 91 mil- 
lion hectares of tropical forest to oil palm 
plantations reduces the land’s value from 
a REDD+ perspective, because much less 
carbon is being stored. Around 72% of the 
frontier forest has already been lost. 

Optical and radar satellites working in dif- 
ferent parts of the electromagnetic spectrum 
can monitor all of these characteristics. But 
international bodies and governments have 
been slow to formulate and agree on common 
guidelines for so doing. Without the right 
technologies or globally accepted assessment 
standards, billions of dollars could be wasted 
on projects that do not deliver. 

The European Union has focused on 
preventing illegal logging, rather than 
specifying how forestry information should 
be collected by countries signed up to its 
2005 Forest Law Enforcement, Govern- 
ance and Trade regulation. This has led to 
a proliferation of incomparable collection 
methods. Moreover, there is no plan for 
REDD+ carbon credits to be accepted in the 
European Union Emissions Trading Scheme. 

Some groups have published guidelines 
for satellite monitoring for REDD+, includ- 
ing the UN Food and Agriculture Organiza- 
tion’s Global Observation of Forest and Land 
Cover Dynamics programme and the Group 
on Earth Observations’ Global Forest Obser- 
vations Initiative. In our view, these reports 
lack ambition and an understanding of the 
potential of satellites. Another group, the 
Committee on Earth Observation Satellites, 
for example, foresees doing only one optical 
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Illegal logging in Indonesia costs billions. 


image survey a year — insufficient to track 
seasonal variations in forest carbon stocks. 

To achieve good annual coverage, optical 
measurements should be taken at least every 
1-2 weeks. To spot illegal logging, coverage 
needs to be even more frequent. Observa- 
tions must be made at least daily, with data 
analysed much more quickly than at the 
current rate of months. If we are to protect 
tropical forests, we must go beyond map- 
ping damage long after it has happened and 
instead offer early warnings. 


WAY FORWARD 

Governments and the IPCC need to accept 
that satellites are the only efficient and realis- 
tic way to provide monitoring for REDD+**. 
This must be enshrined in international law 
through the UNFCCC process and docu- 
mented in detail. Governments must com- 
mit to securing and maintaining systems of 
Earth-observing satellites to perform the 
monitoring. 

Remote-sensing satellites operate in two 
spectral regimes: optical and radar. Optical 
sensors in different colours are sensitive to 
vegetation greenness, fractional tree cover, 
forest type and vegetation density. They 
can cover millions of square kilometres 
in a single image while resolving detail to 
20 metres, or down to 1 metre over a smaller 
area. Current Earth-observation efforts 
include NASAs Terra and Aqua satellites, the 
China-Brazil Earth Resources Satellite pro- 
gramme, the Disaster Monitoring Constel- 
lation (DMC) and France’s SPOT satellite. 

Radar systems, unlike optical systems, 
can penetrate cloud cover, which is com- 
mon in the tropics. The microwave signals 
transmitted by Synthetic Aperture Radar 
(SAR) satellites, such as TerraSAR-X, are 
reflected back by leaves, branches and 
trunks to the satellite's receivers. Such satel- 
lites have historically been expensive, costing 
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between £250 million (US$384 million) and 
£500 million each in orbit. Fortuitously, a 
new generation of lower-cost radar mini- 
satellites is due to be launched within 
the next year. These include the British 
NovaSAR-S range (£45 million launched and 
insured). The European Sentinel-1 mission 
and the Japanese ALOS-2 satellite each cost 
about ten times this. 

Radar and optical systems are both 
needed. An early warning constellation of 
five tropical-orbiting radar satellites should 
provide daily scanning of tropical forests 
at 5-20-metre resolution, irrespective of 
weather, to monitor logging in real time. 
We estimate that this could be done for 
£200 million, plus a dedicated ground crew. 
Ideally, these data should be managed by an 
international consortium that includes devel- 
oped and developing countries, and would 
be run under the auspices of the UNFCCC. 

Low-resolution optical satellites can be 
constructed cheaply to monitor forests weekly 
or monthly throughout the year’. Existing 
satellite systems, including NASA’ MODIS, 
DMC, SPOT or Landsat, could contribute, 
with much of the data made available for free. 

But to provide regular ground data with 
which to calibrate all these satellite observa- 
tions and reduce uncertainties in their carbon 
estimates, major efforts will also be required 
by those countries participating in REDD+. 

As discussions continue this year, we urge 
policy-makers to back the right satellites and 
strategies to monitor and save the world’s 
forests. m 
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GERD HENNIG 


Heed the father 
of cladistics 


The way Willi Hennig discovered evolutionary 
relationships should not be forgotten, say Quentin 
Wheeler, Leandro Assis and Olivier Rieppel. 


Darwin proclaimed that “our classifica- 

tions will come to be, as far as they can 
be so made, genealogies”. That turned out 
to be easier said than done. Even as late as 
the 1970s, biologists were still grouping 
animals and plants largely on the basis of 
overall physical similarity and whether they 
possessed or lacked certain traits, such as a 
backbone or the ability to produce flowers. 

The German entomologist and palae- 
ontologist Willi Hennig transformed the 
classification of organisms into the rigorous 
science of cladistics’ *. His book Phylogenetic 
Systematics’, published in 1966, laid out how 
to construct phylogenetic trees and how to 
use their branching patterns as the basis for 
classifications. 

Paired with DNA sequencing, Hennig’s 
theories revolutionized our understanding 
of the relationships among the nearly two 
million species known today. In the history 


E On the Origin of Species, Charles 


of biological classification, the little-known 
Hennig arguably deserves a place alongside 
Aristotle, Carl Linnaeus and Darwin. 

But two key messages from his book have 
been lost in the nearly half-century since it 
was published: the importance of detailed 
studies of the development and evolution 
of complex characters, such as the horn of 
a rhinoceros or the pincer of a fiddler crab; 
and the use of all relevant evidence — molec- 
ular, anatomical, fossil and developmental 
— in mapping evolutionary relationships. 
Too often these days, DNA information is 
favoured over everything else, and when 
conflicts arise between DNA-based analy- 
ses and those reliant on morphology, the 
former is frequently assumed to be correct, 
even though many uncertainties surround 
the molecular basis of evolution. 

Hennig was born 100 years ago this week, 
on 20 April 1913. In celebration of his impact 
on phylogenetics and classification, we urge 
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biologists to heed his call to embrace all the 
relevant data. 


BIOLOGICAL BEGINNINGS 

Hennig, who died in 1976, was born in the 
village of Diirrhennersdorf in Germany’. 
He worked as a volunteer at the Dresden 
Museum, where he became fascinated by 
the extraordinary diversity of flies, and later 
as a researcher at the German Entomological 
Institute in Berlin until he was conscripted 
when the Second World War broke out in 
1939. Badly injured on the Russian front, 
Hennig ended up serving in the German 
military medical services after his recovery. 

Germany surrendered in May 1945 and 
Hennig, along with other German soldiers 
in Italy, came under Allied control. Recog- 
nizing his entomological expertise, the Brit- 
ish assigned him to their malaria research 
unit, where he remained until his release 
in October of that year. While in this unit, 
Hennig produced a handwritten draft of his 
1950 work, Grundziige, the forerunner of his 
1966 book, based in part on his observations 
of insect evolution. 

From 1947, Hennig commuted for an 
hour-and-a-half each way from his home 
in West Berlin to the German Entomologi- 
cal Institute in East Berlin. The erection of 
the Berlin Wall in 1961 halted his com- 
mute, as well as plans to make him director 
of the institute. In 1963, he became head 
of a new department for phylogenetic 
research at the State Museum of Natural 
History in Stuttgart. 

Hennig received numerous awards in his 
lifetime for succeeding where generations of 
natural historians had failed, including gold 
medals from the Linnean Society of London 
and the American Museum of Natural His- 
tory. Yet the remarkable achievements of this 
quiet, unassuming man are mainly appreci- 
ated only within systematic biology. That 
cladistics is widely used today is thanks in 
part to Hennig’s American “bulldogs, who 
promoted his ideas: Norman Platnick, a spi- 
der expert, Gareth Nelson, a fish expert, and 
James Farris, a theorist in phylogenetics’. 

Before the 1970s, many biologists viewed 
phylogenetic trees and classifications with 
suspicion, and for good reason. The con- 
struction of trees was often based more ona 
hunch than on testable hypotheses, and clas- 
sifications did not strictly reflect branching 
patterns. The class Reptilia, for example, 
included crocodiles but excluded birds, even 
though the anatomical evidence strongly 
suggested that crocodiles and birds share a 
common ancestor. Meanwhile, all that held 
the ‘invertebrates’ together was their lack of 
a backbone. Like Darwin, Hennig believed 
that groupings of species could and should 
be strictly genealogical. He reasoned that 
such phylogenetic classifications could have 
the same organizing function in biology > 
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> that the periodic table of elements had 
had in chemistry. 

This led him to define a ‘monophyletic 
group as one that included an ancestral 
species and all (and only) its descendant spe- 
cies. He also insisted that all the members 
of a monophyletic group share at least one 
evolutionary novelty that arose in the com- 
mon ancestor, such as wings in insects. Until 
this point, monophyly loosely meant sharing 
a common ancestor, whether or not all the 
descendants were included. Using Hennig’s 
approach, Reptilia could be considered a 
monophyletic group only if it included 
birds. At the time, his proposal was severely 
criticized by several prominent biologists, 
including Ernst Mayr, yet this use of mono- 
phyly is now standard practice. 

A great challenge for systematic biologists 
is to work out where in a genealogy an evo- 
lutionary novelty first appears and at what 
level on the phylogenetic tree it is informa- 
tive. The fact that an organism has six legs, 
for instance, indicates that it belongs to the 
group Hexapoda (insects and their near rela- 
tives), but says nothing about whether it is 
a fly (order: Diptera). Hennig emphasized 
that every evolutionary novelty that arose 
in acommon ancestor, either in its original 
or modified form, is potentially informative 
at the appropriate phylogenetic level, and 
advocated the use of all available data — 
from biochemical pathways to skeletal struc- 
tures and genetically controlled behaviours. 
In fact, he argued that living species provide 
a far richer source of historical information 


Pa - 
Willi Hennig stressed that every evolutionary novelty, from a spider’s web-making ability or a giraffe’s 
neck to a fiddler crab’s pincer, is potentially informative in mapping evolutionary relationships. 
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than fossils, in which only a small fraction of 
features are preserved. 


HARD TRUTH 

Hennig never implied that fossil evidence 
should be ignored. But his take on fossils 
surprised many contemporary palaeontolo- 
gists and evolutionary biologists. In the early 
twentieth century, and to a lesser extent, even 
while Hennig was writing his book, the fossil 
record was treated as if it revealed the truth of 
evolutionary history. We think that — as Nel- 
son* argued almost a decade ago — DNA has 
similarly come to be 


wrongly viewedas “DNA has come 
the key to phylog- tobe wrongly 
eny, rather thanjust viewed as the 
one among several key to phylogeny, 
sources ofevidence. vatherthan 

DNA sequencing just one among 
has clear advan- seyeral sources 


tages: vast amounts 
of data can be 
collected quickly and cheaply, and sequences 
seem to offer a more objective measure 
than assessments of complex morphologi- 
cal features. But DNA analyses also involve 
subjective judgements. For instance, the 
assumptions used to produce models of 
molecular evolution, on which such analy- 
ses are based, may differ. And when conflicts 
arise between DNA analyses and those based 
on morphology, fossils or ontogeny, there is 
no theoretical justification for favouring one 
source of data over another. 

Take the relationships among the 


of evidence.” 
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some 9,000 scaly reptile species. Iguanas, 
chameleons and their close relatives had 
long been placed in the lower branches of the 
phylogenetic tree, but various DNA studies 
conducted over the past decade have sug- 
gested that they are actually higher up, near 
snakes and Gila monsters (a venomous liz- 
ard species). A recent analysis of more than 
600 morphological characters now indicates 
that the traditional placement of iguanas in 
the tree is correct after all”. The original phy- 
logeny certainly seems more plausible; for the 
DNA-based branching pattern to be correct, 
an extraordinarily large number of complex 
forms would have had to have independently 
evolved back into an ancestral state. 

Furthermore, used in isolation, DNA- 
sequencing technologies cannot fully 
explain the characteristics, history and ori- 
gins of complex characters, such as xylem 
vessels, seeds, flowers or feathers, that have 
arisen over millions of years. It is hard to see 
how biologists can understand the many 
ways in which organisms have adapted to 
their environments without analysing their 
physical adaptations. 

The progress in exploring and classifying 
biodiversity that Hennig’s work unleashed 
is just the beginning: biologists estimate 
that eight million to ten million plant and 
animal species remain unknown to sci- 
ence, and possibly even more microbes”. 
To understand the details of evolutionary 
history that help to explain this incredible 
diversity, biologists should remember Hen- 
nig’s message: the clues are in evolutionary 
innovations at all levels, molecular, anatomi- 
cal, developmental and behavioural. m 
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E. O. Wilson studying ant behaviour in 1975. 


A guide to the life 


scientific 


Stuart Pimm applauds eminent biologist E. O. Wilson’s 
pragmatic and passionate career advice. 


y mind races through possible 
conversations as I see my phone 
call is from E. O. Wilson. Pulitzer 


Prize winner, author of Sociobiology, laure- 
ate of the top international prizes for envi- 
ronmental sciences, widely acknowledged 
as one of the great scientists of our age, 
emeritus professor still mounting incisive 
attacks on those who challenge his ideas 
on evolution: what could he want to dis- 
cuss with me? “Ants” I told myself quickly, 
organisms about which I know nothing. He 
did not disappoint, grilling me about my 
experiences in the Hawaiian islands, where 
ecosystems assembled without ants. 

I expected Letters to a Young Scientist to 
be about ants, too. It is. Ants appear every 
few pages in its 21 short chapters. Advice 
flows as Wilson identifies them, hunts down 
rare and evolutionarily special ones from Sri 
Lanka to the Australian outback, and makes 
his lab smell by putting chemicals on their 
bodies that trigger their ejection from nests 
as dead, even when they are not. They are 
the focus of global explorations and innu- 
merable simple experiments. 


In this fund of prac- 
tical and philosophi- 
cal guidance distilled 
from seven decades 
of experience, Wilson 
provides exactly the 
right mentoring for 
scientists of all disci- y 
plines — and all ages. 


Thisisavery personal Letters toa Young 

story, from boyhood Scientist 

Scout camp in Ala- EDWARD O. WILSON 
Liveright/W. W. 


bama to Harvard 
University, replete 
with autobiographical 
details, stories of the famous, the eccentric 
(and usually both), of success and, far more 
often, of failure and frustration. 

Wilson takes on that “great bugbear for 
many would-be scientists” — mathemat- 
ics — in Chapter 2. He grants that some 
disciplines require exceptional fluency, but 
warns that a strong mathematical back- 
ground does not guarantee success. He con- 
jures a field of dreams. Build engaging ideas 
and the mathematicians will come. “For 
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every scientist ... there exists a discipline of 
science for which that level of mathemati- 
cal competence is enough to achieve excel- 
lence,’ Wilson posits as a principle. 

Nor is exceptional IQ essential. He 
writes: “let me suggest ... that many of the 
IQ brightest join ... MENSA and work as 
auditors and tax consultants.” Yes, it helps 
to be smart, but one can have too much ofa 
good thing. Wilson thinks that those who 
are too smart don't have to sweat to do sci- 
ence as students, something that he and 
I agree is necessary to build the requisite 
character. 

There are admonitions for later stages in 
careers. “Avoid department-level admin- 
istration.” “Make excuses, dodge, plead, 
trade.” “Spend extra time with students 
who show talent and interest in your field of 
research.” “Consider carefully job offers that 
include fewer ... administrative responsibil- 
ities” This book will quickly be on the Index 
Librorum Prohibitorum of college deans. 

Wilson sees daydreaming as a key exer- 
cise for success, and suggests: “Make talking 
to yourself silently a relaxing pastime.” My 
personal experience suggests that silence is 
not an essential ingredient of that recipe, 
although my wife would appreciate it if 
it were. He also urges his putative young 
scientist to explore. He won the coveted 
Explorers Club Medal, despite never ven- 
turing to the poles or making first contact 
with an Amazonian tribe. But his exploring 
for biodiversity has made many aware that 
scientific frontiers are all around us. 

Can creativity be engineered? Wilson 
visits the Santa Fe Institute in New Mexico, 
where personal office space is deemed 
distasteful, and Google’s headquarters in 
Mountain View, California. When I visited, 
I tried not to overindulge in ad libitum soft 
drinks, coffee and nuts. And I still haven't 
got over the advice on writing better com- 
puter code pasted above the men’s urinal. 
These places impress Wilson too. Nonethe- 
less, he believes that the creative process 
“arises and for a while generates in a soli- 
tary brain”. 

“You will make mistakes. Try not to make 
big ones,’ he concludes. Amen to that. Yet 
Wilson begins the book with the most 
important advice of all: “enduring passion 
will never fail you.” It hasr’t failed him. 

This is no pompous, deeply philosophical 
treatise on how great ideas develop. Wilson 
shares his simple love for ants and their 
natural history, revelling in them without 
hesitation. Everything else follows. m 
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The pituri plant has long been known for its narcotic properties. 


Feeling no pain 


John Carmody enjoys an exhibition that charts the 
trajectory of anaesthesia from its botanical beginnings. 


crass modern summary of academic 

career-building. Without the writ- 
ten or printed word, work can be forgotten 
for centuries — or even forever. 

This message is beautifully underlined in 
History of Anaesthesia, an exhibition on pain 
relief at the University of Sydney, Australia, 
that draws on the institution’s splendid Rare 
Books and Special Collections, as well as 
the equipment museum maintained by the 
Australian Society of Anaesthetists. Through 
some 200 books, official reports, quaint and 
serious apparatus, and illustrations spanning 
more than five centuries, the show explores 

the age-old quest to 


cc Pes or perish” is more than a 


> NATURE.COM dull agony. 

For more on the The exhibits are as 
science ofpain,see: © much a reminder of 
go.nature.com/npSpcy this great aspiration 
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as of the panoply of _ History of 

methods — exotic, Anaesthesia 

futile and pragmatic Fisher Library, 

— designed to fulfil it, UV" of Syaney, 
ae 8 “Australia. 

But it is the books that Until 12 May 201 3 

best reveal the think- 


ing of our intellectual forebears. 

Because almost all that is ‘physic is derived 
from plants — the pain-relieving potions 
concocted from, say, the opium poppy — 
the exhibition begins with medical botany 
as systematized in early ‘herbals. Perhaps 
the most precious is a book translated from 
Latin by the English surgeon John Halle. The 
herball: an expositive worke after the order 
of the alphabet, published in 1565, is actu- 
ally an extract from Lanfranco of Milan’s 
seminal thirteenth-century surgical treatise 
Chirurgia parva. 

Halle, an Elizabethan progressive, 
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championed sourcebooks in the vernacular 
to promote learning. An important figure as 
modern medicine began to emerge in Eng- 
land, he summarized his aim as “the edifica- 
tion and building up of good science, and 
to the subvertion of all haters and abusers 
of the same”. 

Among other traditions of botanical med- 
icine, an intriguing Australian example is the 
little paper “Pituri and Duboisia’, which the 
parasitologist and surgeon Joseph Bancroft 
read to the Queensland Philosophical Soci- 
ety in 1877. Bancroft refers to the pituri plant 
(Duboisia hopwoodii), widely used by indig- 
enous Central Australians as “a stimulating 
narcotic”. 

The renowned German botanist Ferdi- 
nand von Mueller, who settled in Melbourne, 
identified Bancroft’s samples and surmised 
that Duboisia’s properties might be similar 
to the narcotic effect of stramonium, an 
alkaloid extracted from a plant of the night- 
shade family and used for centuries to relieve 
asthma and pain. We learn from the paper 
that Bancroft, testing it on his domestic 
pets, found the effects “strange’, adding that 
“they seem blind, or nearly so, with a widely- 
dilated pupil”. Afterwards, while dosing sev- 
eral of his ophthalmic cases, he “found an 
action of great rapidity” and quickly alerted 
receptive colleagues elsewhere in Australia. 
The country’s plantations of Duboisia spe- 
cies and hybrids still supply around 70% of 
the global stocks of the alkaloids scopola- 
mine and atropine. 

As chemistry developed, the active con- 
stituents were purified — crucially for 
anaesthesia, which was perhaps the most 
pharmacologically dependent medical spe- 
ciality. Equally important were fundamen- 
tal discoveries about atmospheric gases and 
respiration — both crucial to inhalational 
anaesthesia. Here, the German-Swedish 
chemist Carl Wilhelm Scheele’s name should 
be pre-eminent: a 1780 English translation of 
his Chemical Observations and Experiments 
on Air and Fire is rightly part of the exhibi- 
tion. He would be better remembered had 
he not been usurped by Antoine Lavoisier, 
whom he told how to prepare oxygen (ina 
letter made public in Paris in 1992). 

Beyond revelling in the beauty and nostal- 
gia of remarkable old books, this exhibition 
reminds us that perhaps no area of medicine 
relies more than anaesthesia on a diversity of 
scientific understanding. The modern man- 
agement of pain — the “perfect misery, the 
worst of evils’, as John Milton described it 
in Paradise Lost — has been an intellectual 
venture. m 
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The word on popular science 


To mark UNESCO’s World Book and Copyright Day on 23 April, Josie Glausiusz asks 
science editors at leading book publishers about trends and technology. 


nimated, interactive, and evolving at 
A reader’s command: this is the pop- 
ular science book of the near future, 
according to T. J. Kelleher, the science com- 
missioning editor at Basic Books in New York. 
“We can expect to see new kinds of books 
that would have been impossible on paper,’ 
Kelleher says. “The straightforward book that 
is nothing but words is not going any place; 
people obviously like to read, but there are 
other ways they can experience information” 
Soa mathematics e-book might incorporate 
a user interface that lets readers change vari- 
ables: for example, the density, volume and 
speed ofan asteroid hurtling towards Earth. 
Basic Books has already taken the plunge. 
In October 2012, the company published 
a version of Richard Feynman’s Six Easy 
Pieces for the iPad that incorporates audio 
recordings of Feynman’s physics lectures 
at the California Institute of Technology in 
Pasadena — and 5,000 photographs of the 
revered Nobel laureate delivering them. 
My poll of science editors at major pub- 
lishers revealed enthusiasm for apps and 
e-books. In September 2011, Farrar, Straus 
and Giroux (FSG), in a joint imprint with 
Scientific American, launched its first app. 
Journey to the Exoplanets has more than 


SCIENCE BOOK SALES 


100 illustrations, as well as essays and audio 
clips from planetary scientists, including 
Caleb Scharf and Lawrence Krauss. 

The electronic format, says Latha Menon, 
a senior commissioning editor at Oxford 
University Press in Oxford, UK, is “ideal 
for transmitting science ideas’, because ani- 
mations can be used to illustrate processes 
such as natural selection. In one example, 
she says, “you can actually see how the light- 
grey moths are picked off” in preference to 
dark moths camouflaged by soot-grimed 
tree trunks, an adaptation seen during the 
industrial revolution. 

And, notes FSG senior editor Amanda 
Moon, “the thirst for knowledge about topics 
that fall under the broad umbrella of ‘science’ 
is intensifying. We all have questions about 
evolution, technology, psychology, animal 
behaviour” Others echo her, although with 
none prepared to divulge exact sales figures, 
it is hard to know whether their enthusiasm 
reflects profits. 

The science books that find a devoted 
readership these days, says Moon, “boast 
provocative arguments grounded in origi- 
nal research, are character-driven narratives 
and, in many cases, offer counterintuitive 
angles on seemingly familiar topics”. She 
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cites Kevin Dutton’s The Wisdom of Psycho- 
paths (Scientific American, 2012), which 
suggests how we can learn from “functional 
psychopaths” such as some neurosurgeons 
and lawyers. Authors, she adds, now have 
many more opportunities to get the word out 
about groundbreaking ideas and research 
through platforms such as Facebook, Twit- 
ter and Goodreads. 

Many of today’s readers are prepared for a 
challenge. “What's interesting,” says Menon, 
“is that there seems to be a significant mar- 
ket of people out there who perhaps studied 
some science or even did a science degree’, 
and then went off to do something else. That 
readership, Menon adds, is sophisticated, 
demanding books that are “conceptually, a 
little stretching”: for example, Jim Baggott’s 
The Quantum Story (Oxford Univ. Press, 
2011), a history of quantum theory from 
1900 to the present in 40 key moments. Epi- 
genetics, neuroscience and particle physics 
— the last prompted by the discovery last 
year of a Higgs boson — are currently big, 
she says. Although, as in every other field, 
science book publishing is dominated by tel- 
evision tie-ins and celebrity authors such as 
Brian Cox and Stephen Hawking. 

Christie Henry, editorial director of 
sciences and social sciences at the Univer- 
sity of Chicago Press in Illinois, concurs that, 
at present, the public appetite for science 
seems voracious (see ‘Science book sales’). 
“The most pressing issues we face as a global 
society rely on science to some extent,’ she 
says. Books on water rights, infectious dis- 
eases, conservation and marine science are 
all proving popular, she explains. 

All agree that well-written science books 
are unlikely to disappear, even in an age in 
which the average person's eyes seem to 
be perpetually locked on mobile devices. 
“The quality of the writing and the quality 
of the narrative matter,” says Menon. Clas- 
sics such as Richard Dawkins’ The Selfish 
Gene (Oxford Univ. Press, 1976) and Primo 
Levi's 1975 The Periodic Table continue to 
sell well. They have, says Menon, “a literary 
quality, a structure and a beauty. There are 
just a few scientists who can write like that, 
and that is something to be treasured and 
valued. I would hope that that continues to 
have a home.” = 


Josie Glausiusz is a writer based in Israel. 
e-mail: josiegz@gmail.com 
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Overfishing in west 
Africa by EU vessels 


China is not the only culprit 
threatening the marine 
ecosystems of west Africa 
through overfishing (Nature 
496, 18; 2013). The region has 
been the fish basket of southern 
Europe since the 1960s, and the 
European Union (EU) is going to 
great lengths to renew its fishery 
agreements with African nations. 

These agreements are 
questionable because they 
promote the export of African 
marine resources into the EU to 
the detriment of local economies 
and food sovereignty. Morocco 
has allowed EU vessels into the 
national waters of neighbouring 
Western Sahara, effectively 
hijacking its fish stocks. 

The uncontrolled harvesting 
of pelagic fish and octopus by 
hundreds of foreign fishing 
vessels is having a deep ecological 
impact on the Atlantic Ocean's 
Canary Current upwelling marine 
ecosystem, a major biodiversity 
hotspot. It is also destroying vast 
amounts of by-catch. 

Research programmes are 
under way to quantify this 
collateral damage. Some highly 
vulnerable and endangered 
marine vertebrates are affected, 
including monk seals, several 
shark species, dolphins and 
sea turtles, and hundreds of 
thousands of Macaronesian and 
Palaearctic seabirds that breed 
and overwinter along the west 
African coast (see, for example, 
J. Zeeberg et al. Fish. Research 78, 
186-195; 2006). 

We therefore urge EU 
parliament members to rethink 
their African fishing policies, 
properly taking into account the 
devastating political, social and 
ecological impact of their large 
industrial fisheries. 

Raiil Ramos Centre for 
Functional and Evolutionary 
Ecology (CEFE), CNRS, 
UMR5175, Montpellier, France. 
raul.ramos@cefe.cnrs.fr 

David Grémillet CEFE-CNRS 
Montpellier, France; and PFIAO, 
DST/NBF Centre of Excellence, 


University of Cape Town, South 
Africa. 


Scale of global road 
map is impractical 


William Laurance and Andrew 
Balmford’s proposed global 
zoning map (Nature 495, 
308-309; 2013) for road planning 
would be too crude in scale to 

be useful for practical, on-the- 
ground decisions. 

Positive outcomes for 
biodiversity from road building 
are more likely to be achieved in 
a local context. Decisions about 
road developments are not made 
ona global scale, and rarely even 
on a national level. 

Any road planned with 
environmental and social 
considerations in mind is better 
than one designed without them. 
But decision-makers should not 
shift their focus too far from the 
major problems that proposed 
roads might create. 

Malgorzata Blicharska Swedish 
University of Agricultural 
Sciences, Uppsala, Sweden. 
malgorzata.blicharska@slu.se 


Uphold standards 
for lab reagents 


Scientists assume that reagents 
supplied commercially have been 
rigorously tested for purity. We 
have found that this assumption 
can be risky. 

Our laboratory purchased 
3 milligrams of a biomolecule 
from a reputable supplier to 
investigate its effect on nerve 
cells. Anomalous results 
forced us to test its purity. 
Using mass spectrometry and 
qualitative western blotting, we 
discovered unacceptable levels of 
contamination with several other 
biologically active molecules. 

Several research groups have 
published important mechanistic 
conclusions based on the activity 
of this molecule. Contamination 
with other active components 
would call these results into 
question. 
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Reagents that are misleadingly 
labelled waste researchers’ funds 
and time. Suppliers should 
declare the accurate composition 
and modes of analysis for every 
batch of their products. 

Sally Roberts, Heidi Fuller 
Keele University, Staffordshire, 
UK; and Robert Jones & Agnes 
Hunt Orthopaedic Hospital, 
Oswestry, UK. 
sally.roberts@rjah.nhs.uk 
Bruce Caterson Cardiff 
University, UK. 


Crowd-sourced soil 
data for Europe 


Since its launch by the UK 
Natural Environment Research 
Council in June 2012, the free 
iPhone app mySoil has attracted 
more than 2.6 million web hits 
and 12,500 dedicated users 

(see go.nature.com/tylukf). 
Next month, a new version will 
extend its coverage to soils in the 
European Union (EU). 

Users of mySoil can currently 
view soil maps of the United 
Kingdom (excluding Northern 
Ireland) that provide regional 
information on soil depth, 
texture, pH and organic-matter 
content, and on vegetation 
habitats. They can upload photos 
and descriptions of their local 
soils. More than 500 entries have 
so far contributed to this valuable 
data bank of soil properties in 
different localities. 

The remarkable success of 
collecting data in this way reflects 
widespread public interest and 
promotes communication with 
the science community. 

With infrastructure covering 
an estimated average of 9% of the 
EU — often spread across the 
most productive soils — and with 
sustainable land management 
becoming increasingly urgent, 
mySoil’s democratization of soil 
data is a timely development. 
Wayne Shelley, Russell Lawley 
British Geological Survey, 
Nottingham, UK. 

David A. Robinson Centre for 
Ecology & Hydrology, Bangor, UK. 
davi2@ceh.ac.uk 
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Chinese institute 
clarifies patent case 


As director and deputy director 
of the Institute of Microbiology, 
Chinese Academy of Sciences 
(IMCAS), we would like to 
clarify some points in your 
discussion of our involvement 
in an intellectual-property case 
(Nature 492, 323; 2012). 

Without our knowledge, 
Shandong Hilead Biotechnology 
in Laiyang — a recipient of the 
dodecanedioic acid technology 
invented by IMCAS, with which 
Hilead collaborated to further 
improve this technology — filed 
a patent application in 2010 
that listed both of us and five 
others as inventors (patent 
CN201010160266.4). 

When we discovered this, we 
demanded that our names be 
dropped from the application 
because we had not made any 
technical contribution to the 
patent. Hilead filed an application 
for a change of inventors listed 
on the patent on 25 January 2011 
to the State Intellectual Property 
Office of China, who approved 
the change on 14 February 2011 
(file 2011021000026950). So the 
removal of our names occurred a 
few months before two employees 
at Cathay Industrial Biotech in 
Shanghai filed a lawsuit over the 
patent in June 2011. 

Hilead officially apologized 
on 17 August 2011 ina letter 
to IMCAS for using our names 
without consent. The court 
decision on the lawsuit exempted 
us from civil liability and no 
apology was required from us 
(Beijing Higher People’s Court 
decision 3975, 16 November 2012). 
Li Huang, Shuang-Jiang Liu 
IMCAS, Beijing, China. 
huangl@im.ac.cn 


CORRECTION 

A declaration of competing 
financial interests was omitted 
from the Comment ‘A jump- 
start for electroceuticals’ 
(Nature 496, 159-161; 2013). 
This has been added online. 
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A potted history of Japan 


The discovery of lipids on ceramic fragments from the Japanese Jomon period provides the earliest evidence for the use of 
pottery for cooking and may prompt a rethink of some aspects of human innovation. SEE LETTER P.351 


SIMON KANER 


r | The invention of pottery is one of the 
great technological innovations in 
human history. The recognition 

that fire could be used to transform soft 

clay into impermeable ceramics repre- 
sented a cognitive shift in the way people 
engaged with the material world around 
them'. Until recently, archaeologists usu- 
ally regarded pottery as being associated 
with agriculture. Pottery usage among 
hunter-gatherers was considered somewhat 
anomalous and counter- intuitive; fragile pots 

did not seem to have a place in the mobile 

lifestyles thought to characterize most human 

existence before the advent of farming villages 
during the Neolithic, from about 10,000 years 
ago in the eastern Mediterranean. But the 
discovery of lipids on ceramic vessels in East 

Asia dating from the Late Pleistocene, about 

15,000-12,000 years ago, reported by Craig 

et al.” on page 351 of this issue, suggests that 
some hunter-gatherers used pots for cooking. 

The report also provides a demonstration of 

how science should be integral to our piecing 

together of history’. 

Craig and colleagues report the first recov- 
ery and analysis of lipids from charred deposits 
— the remains of prehistoric meals — stuck 
on the surfaces of potsherds of some of these 
ancient pottery containers. The authors’ 
description of the lipids, supported by stable- 
isotope analysis, indicates that Late Pleistocene 
peoples in what is now the Japanese archi- 
pelago were using pottery to cook fish, both 
freshwater and marine. With claims of pottery 
in East Asia extending back even further than 
this, perhaps as far as 20,000 years ago’, the 
application of rigorous scientific analyses to 
these materials is of great relevance to further- 
ing our understanding of the cultural contexts 
in which pottery was first made and used. 

Indications of Late Pleistocene pottery are 
not completely new, but little is known about 
what these vessels were used for. In the late 
1960s, thin ceramic potsherds associated with 
tiny obsidian stone blades, or microliths, were 
excavated at the Fukui Cave in southwestern 
Japan, dated to around 12,000 years ago’. 


*This article and the paper under discussion were 
published online on 10 April 2013. 
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Figure 1 | Fish soup from the Late Pleistocene. 
Ceramic pots dating to the Late Pleistocene 
have been found in several sites throughout East 
Asia. This Incipient Jomon pot from Kubodera- 
minami in Niigata prefecture in Japan is around 
15,000 years old. Craig and colleagues’ analysis” 
of such pottery has revealed lipid deposits that 
suggest the pots were used to cook fish. 


But it was the application” of a new type of 
radiocarbon dating using accelerator mass 
spectrometry — made famous by its use in the 
reassessment of the date of the Turin Shroud — 
to carbonized accretions on the surface of pot- 
sherds from a small undecorated vessel from 
northern Japan, at Odai Yamamoto, that set the 
debate over the existence of Late Pleistocene 
pottery alight. 

When the early dates for pottery use at Fukui 
Cave and other sites were first announced, 
many Japanese archaeologists were reluctant 
to accept them. At the time, chronologies of 
Japanese prehistory were based largely on the 
recognition of different styles of pottery. This 
framework had its origins in pre-war research 
that avoided explicit chronologies so as not to 
clash with the version of Japanese history laid 
down in two eighth-century texts, the Kojiki 
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and Nihon Shoki. Under the militaristic 
regime of the 1930s and ’40s, questioning 
these sacred sources was unacceptable. 
Even after 1945, as in Europe, authority 
figures found it impossible to reconcile 
new radiocarbon-based dating of archae- 
ological finds with their own style-based 
chronologies®”. 
Furthermore, until the 1990s, the ances- 
try of most of the current population of the 
Japanese archipelago was thought to trace 
back to the rice-farming peoples of the Yayoi 
period, then dated from about 300 Bc to about 
AD 300. The arrival of paddy-rice agriculture 
in the first millennium Bc was regarded as 
marking a major break in population and cul- 
ture from the preceding Jomon period, named 
after the eponymous cord-marked pottery 
from the time (Fig. 1). The Jomon people were 
largely considered to be primitive aboriginals 
who had little to do with the development of 
later Japanese culture’. 
But the plethora of new archaeological dis- 
coveries associated with the post-war devel- 
opment boom changed the image of Jomon 
society. The discovery and excavation in the 
early 1990s of one of the largest known Jomon 
settlements, at Sannai Maruyama, just a few 
kilometres from Odai Yamamoto, produced the 
remains of buildings and burials, and evidence 
for spatial planning and a rich material culture, 
including tonnes of potsherds. Radiocarbon 
dating demonstrated that the site was continu- 
ously occupied, at varying levels of intensity, 
for nearly two millennia’. These findings were 
announced as Japan was coming to terms with 
the end ofa seemingly endless trajectory of eco- 
nomic growth, and the Jomon culture captured 
the public imagination as offering an alternative 
vision of sustainable community life in the Japa- 
nese archipelago. The Jomon became embraced 
as ancestral and in tune with the natural envi- 
ronment of Japan”. 

Soon after, however, Japanese archaeol- 
ogy went into crisis. In 2000, claims made in 
the 1980s and ’90s of increasingly early dates 
for ‘Early Palaeolithic sites across eastern 
Honshu were shown to be the result of fraud- 
ulent activity (for a summary, see ref. 11). 
The revelations sent shock waves through 
the field and undermined public trust in the 
archaeologists who had worked so hard to 
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create a new past for Japan since 1945, when 
the version of history provided by the Kojiki 
and Nihon Shoki was no longer accepted as 
defining early Japanese history”. 

Specialists in the techniques that are used to 
identify fatty acids on archaeological finds were 
implicated in this fraud, bringing the reliability 
of these methods into question. The unfortu- 
nate situation led to an enhanced recognition 
that archaeologists need the skills to critically 
assess scientific information, and that this must 
be open to international scrutiny. The analyses 
presented by Craig et al. not only restore faith 
in the use of lipid-analysis techniques for mak- 
ing inferences about archaeological objects, 
but also are representative of the tremendous 
resource offered by Japanese archaeology for 
understanding the human past. 

Our knowledge that ceramic containers 
were being made and used by hunter-gatherers 
in the Late Pleistocene in various parts of East 
Asia — from Japan to far eastern Russia and 
north and south China — means that pottery 
usage among hunter-gatherers is no longer 
seen as anomalous in the Old World. In fact, 
there may be evidence for routes of the intro- 
duction of pottery into Europe that are not 
associated with the introduction of farming”. 

Yet many questions remain about how 
and why pottery was invented. For example, 
the earliest known ceramics are not contain- 
ers at all, but fragments of figurines, such as 
those dating to 29,000 years ago from the 
Dolni Véstonice site in the Czech Republic”. 
Although the earliest known containers were 
probably used for cooking, the special signifi- 
cance given to this new technology from the 
outset should not be overlooked. The fact that 
the prehistoric foragers who were using these 
implements were not rigorous in their washing 
up also allows us, to some extent, to analyse the 
contents of their meals, as Craig and colleagues 
have begun to do. We now need to extend such 
evaluation to the cultural meanings of what 
they were eating in order to further understand 
why they needed pottery containers”. m 


Simon Kaner is at the Centre for Archaeology 
and Heritage, Sainsbury Institute for the Study 
of Japanese Arts and Cultures, and at the 
Centre for Japanese Studies, University of 

East Anglia, Norwich NR4 7T], UK. 

e-mail: s.kaner@uea.ac.uk 
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A cosmic growth spurt 
in an infant galaxy 


One of the most extreme starburst galaxies in the early Universe has been 
identified and characterized. This system shows the rapid formation of a massive 
galaxy when the Universe was only 6% ofits current age. SEE LETTER P.329 


DESIKA NARAYANAN & CHRIS CARILLI 


tarburst galaxies play a central part in 

our understanding of galaxy formation. 

These galaxies — which are defined by 
star-formation rates that exceed those of simi- 
larly massive normal galaxies by factors of 10 
to 100 — are probably the cosmic ancestors of 
present-day massive elliptical galaxies, and the 
sites at which supermassive black holes grow 
most actively. Charting the evolution of these 
galaxies through space and time is a funda- 
mental challenge for astronomers, and a task 
that is far from trivial. Simply finding these 
sources at early cosmic times requires very 
sensitive multiwavelength observations, clever 
use of data-selection techniques and a bit of 
luck. On page 329 of this issue, Riechers and 
colleagues’ develop a technique to find massive 
starburst galaxies in the early Universe. They 
report the detection of the farthest known 
pure starburst galaxy, which existed a mere 
880 million years after the Big Bang. 

This discovery builds on a long history of 
observational efforts to find and characterize 
the physical properties of starbursts at high 
redshift (a measure of location in cosmic time). 
Since the original discovery of distant, dust- 
obscured starburst galaxies at high redshift in 
1998, through the first sensitive infrared and 
submillimetre surveys of the sky”’, there has 
been an explosion of interest in the field. This 
has been motivated in part by the development 
of sensitive ground-based and space infrared 
telescopes, such as NASA's Spitzer Space Tele- 
scope, and the far-infrared and submillimetre 
capabilities of the European Space Agency’s 
Herschel Space Observatory. Starburst galax- 
ies are typically hidden at optical wavelengths, 
owing to copious amounts of obscuring dust; 
astronomers, therefore, make use of submil- 
limetre and infrared telescopes that capture 
the wavelength regime in which most energy 
from these galaxies is emitted. During the past 
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decade, researchers have learned that these 
high-redshift starbursts reside in exception- 
ally massive haloes of dark matter (with masses 
equivalent to 10 trillion Suns), and that they 
correspond to the most luminous star-forming 
galaxies in the Universe. 

Despite the marked advances in infrared 
and submillimetre-wave technology during 
the 2000s, measuring the exact distribution of 
redshifts of these galaxies has proved notori- 
ously difficult. Older samples of these infra- 
red- and submillimetre-wavelength-selected 
galaxies required heroic observational efforts 
to determine their redshifts: galaxies had to be 
identified in deep submillimetre surveys, and 
their precise position established with radio 
interferometers, before they could be observed 
with large optical telescopes to obtain a spec- 
tral line that could be used in redshift identifi- 
cation’. This process showed that high-redshift 
starbursts predominantly reside at a redshift 
of about 2 (roughly 3 billion years after the Big 
Bang). However, a nagging worry persists that 
radio selection has resulted in a low-redshift 
bias; that is, is there a substantial population of 
massive starbursts at earlier cosmic times that 
has been missed in recent studies? 

Riechers and colleagues found starbursts 
at record-setting redshifts by using a clever 
technique that capitalizes on the fact that 
these systems are expected to be extremely 
dusty. As a part of the Herschel Multi-Tiered 
Extragalactic Survey (HerMES), the larg- 
est project assigned to the Herschel Space 
Observatory, the authors searched 21 square 
degrees of the sky, and looked for sources that 
were extremely red in the telescope’s 250-, 
350- and 500-micrometre-wavelength filters. 
The team (correctly) guessed that these were 
very early galaxies whose already red colours 
were accentuated by the expansion of the 
Universe. Moreover, the authors capitalized 
on new broadband (radio through submilli- 
metre) spectrographs to obtain a redshift from 
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——— Outflow 


Figure 1 | A forming starburst. Mergers between two gas-rich galaxies 

can drive some of the most intense starburst galaxies in the Universe. This 
time sequence of images shows a numerical simulation” of the expected 
view by the Atacama Large Millimeter/submillimeter Array in Chile of the 
star-forming molecular gas of a high-redshift galaxy merger that has physical 


atomic and molecular lines, revealing a source 
(monikered HFLS 3) at redshift 6.34 — the 
highest-redshift infrared-selected starburst 
galaxy found so far’. 

A major added value of this work, how- 
ever, is that the authors go well beyond sim- 
ple ‘stamp-collecting’ goals of finding the 
earliest starburst galaxies, and obtain a huge 
suite of imaging and spectroscopic data from 
12 international telescope facilities to charac- 
terize the physics that drives the evolution of 
the galaxy. With broadband imaging data that 
ranged in wavelength from the optical through 
to centimetre-wave photons, Riechers et al. 
derived an incredible star-formation rate for 
this system — about 3,000-5,000 solar masses 
per year. This contrasts sharply with the pal- 
try roughly 2 solar masses per year of our own 
Milky Way. HFLS 3 may be the galaxy with the 
highest star-formation rate found so far. 

Following on from this, the team analysed 
numerous emission lines from carbon mon- 
oxide (a standard tracer of star-forming gas in 
galaxies), an emission line from singly ionized 
carbon (one of the dominant coolants in gal- 
axies) and emission lines from water. From an 
analysis of the excitation of carbon monoxide, 
as well as atomic-to-molecular line ratios, they 
determined that the gas participating in the 
compact starburst is probably illuminated by an 
intense radiation field, and may be thought of 
as a galaxy-wide star-formation region, compa- 
rable to the densest, most actively star-forming 
regions at the centres of giant molecular clouds 
in the Milky Way. In other words, this work has 
identified one of the most extreme star-forming 
environments in the Universe, at a time when 
the Universe was only 6% ofits current age. The 
interstellar medium in HFLS 3 was already sub- 
stantially enriched in the standard molecules 
that populate the interstellar medium in nearby 
spiral galaxies, such as our own. The authors 
further suggest that a merger between two 
gas-rich galaxies in the early Universe may be 
driving the starburst event, as well as a powerful 
molecular outflow (Fig. 1). 

Perhaps the most exciting aspect of this 
and similar® studies is the sheer potential in 
this field. Wide-band spectrometers on the 
Caltech Submillimeter Observatory in Hawaii, 
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the Plateau de Bure Interferometer in France, 
the Green Bank Telescope in West Virginia, 
the Karl G. Jansky Very Large Array in New 
Mexico and the Atacama Large Millimeter/ 
submillimeter Array in Chile now allow for 
rapid calculation of redshifts through the 
detection of multiple molecular emission lines, 
while avoiding the redshift biases that plagued 
previous studies’. Thus, capabilities will only 
continue to improve in the coming years with 
the Large Millimeter Telescope in Mexico 
and Cornell Caltech Atacama Telescope in 
Chile. Surveys should soon reveal the cosmic 
number-density evolution of high-redshift, 
optically obscured starbursts — a quantity 
that will place strict constraints on models 
of galaxy formation and evolution’. Further- 
more, the Atacama Large Millimeter Array 
holds incredible promise for providing exqui- 
site detail of the physical conditions in these 
extreme systems’. a 


SENSORY BIOLOGY 


conditions comparable to those observed in the starburst galaxy detected by 
Riechers and colleagues’. The simulated system drives a powerful molecular 
outflow from the nucleus after coalescence of the galaxies. The colours denote 
carbon monoxide emission intensity. Yellow corresponds to an intensity 
roughly 1,500 greater than black. Each panel is 50 kiloparsecs on a side. 


Desika Narayanan is in the Department of 
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Radio Astronomy Observatory, Pete V. 
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Mexico 87801, USA. 
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A whiff of genome 


The relatedness of animals influences their behaviour, but how do they detect 
their kin? It seems that nasal sensory neurons differentiate between minute 
structural differences in urinary peptides that reflect genetic variation. 


THOMAS BOEHM 


sk ten people how they would go about 
finding the right mate, and you will 
probably get ten different answers. But 
might they all be mistaken? Writing in Nature 
Communications, Sturm et al.’ describe how 
the olfactory system is an amazingly sensitive 
and accurate discriminator of genetic related- 
ness, and suggest that this faculty may pro- 
vide essential information that influences kin 
recognition, mate choice and other types of 
social interaction. 
Many aspects of animal behaviour require 
information about the extent of genetic 
relatedness to other individuals. During the 
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selection of the best possible mate, for example, 
some means of detecting family connections 
is essential to avoid inbreeding. Many sensory 
inputs contribute to solving this behavioural 
jigsaw puzzle, but the role of smell has received 
special attention. Volatile chemical com- 
pounds that attract potential mates over long 
distances are textbook examples of gender- 
and species-specific information carriers, but 
these pheromones convey no information 
on individuality. One possible source of such 
information is the thousands of DNA sequence 
differences present in animal genomes that 
allow even closely related individuals of a spe- 
cies to be easily distinguished”. It seems that 
animals are capable of converting this genetic 


code into olfactory information™, but exactly 
how this is achieved is only just starting to 
become clear. 

The genetic differences between individu- 
als often manifest themselves in the altered 
arrangement of amino-acid residues in pro- 
teins. Could such alternative protein forms 
be converted into individuality signatures? At 
face value, proteins constitute unlikely olfac- 
tory stimuli because of their large size. But 
the proteins and small peptides (generated from 
protein breakdown) that are present in bodily 
fluids can become aerosolized and thereby gain 
direct access to the olfactory system. 

In mice, the olfactory detection of two main 
types of genetically variable, or polymorphic, 
protein have been studied: the major urinary 
proteins (MUPs)’, which are excreted in 
copious amounts in the urine, and the pro- 
teins of the major histocompatibility complex 
(MHC)*, which are indispensable for effective 
immune responses. Indeed, controversy has 
arisen about which of these two classes of 
protein is responsible for individuality signals 
in mice*®. 

Structural differences in MUPs are thought 
to be sensed directly by the olfactory system’, 
whereas it has been proposed’ that the struc- 
tural diversity of MHC molecules is evaluated 
indirectly on the basis of the ligands that bind 
to them (Fig. 1). These ligands are short pep- 
tides that are breakdown products of intra- 
and extracellular proteins, and their structure 
reflects the binding properties of the MHC 
molecules, in a mode akin to a lock and key*. 
Hence, the ‘MHC peptide hypothesis’ suggests” 
that olfactory sensing of the key (the peptide 
ligand) provides information about the lock 
(the MHC protein). 

Peptides have been shown to activate olfac- 
tory sensory neurons at exceedingly low 
concentrations’, thereby satisfying one predic- 
tion of this hypothesis, but their presence in 
urine had not previously been demonstrated. To 
fill this void, Sturm et al. used mass spectrom- 
etry to search for urinary peptides that have 
characteristic sequences predicted to match the 
MHC molecules of mice with different geno- 
types. Although the authors found many such 
peptides, they made the surprising observation 
that the presence of these peptides was inde- 
pendent of MHC genotype. However, they also 
found that, in addition to MUPs, many other 
distinguishing genetic polymorphisms were 
reflected in the sequences of urinary peptides, 
suggesting that this complex ‘peptidome’ mir- 
rors at least some part of genetic individuality. 

But what about MHC peptide ligands that 
are predicted’ to also occur in the urine? Sturm 
and colleagues suggest that mass spectrometry 
is an unsuitable method to reproducibly detect 
low concentrations of specific MHC pep- 
tide ligands among the plethora of other uri- 
nary peptides present at million-fold higher 
concentrations. To get around this problem, 
the authors used ‘empty’ MHC molecules 
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Figure 1 | Alternative sources of ligands for olfactory sensory neurons. a, Sturm et al.' show that 
degradation products of proteins are excreted into the urine and that these peptides serve as ligands 

for olfactory sensory neurons. Peptide variants that differ by a single amino-acid residue can activate 
different neurons, allowing the detection of genetic differences that lead to amino-acid alterations. b, The 
authors also show that peptides bound to MHC molecules activate olfactory neurons following liberation 
from the MHC and excretion. Because the structure of MHC molecules differs between individuals, the 
neuronal activation caused by these structurally diverse peptides will also reflect individuality. 


to capture ‘free’ peptide ligands from mouse 
urine. By this means, they obtained unequivo- 
cal evidence that MHC ligands are indeed 
present in urine. They then studied the 
response of individual nasal sensory neurons 
to MHC-dependent and MHC-independent 
peptides, revealing that not only do individual 
neurons respond to these molecules at biologi- 
cally relevant concentrations (ranging from 
10°’ to 10 °moles per litre), but they also faith- 
fully discriminate between the two classes of 
peptide. Furthermore, the researchers observed 
distinct activation patterns in sensory neurons 
elicited by cocktails of peptides from mice 
of different genotypes, suggesting that these 
urinary peptides should thus be considered to 
constitute an ‘individuality signature’ 

These findings suggest that the two main 
competing hypotheses about how the olfactory 
system detects genetic individuality (through 
either MUPs or MHC ligands) may not be as 
mutually exclusive as was previously assumed. 
Instead, it seems that the sequence-dependent 
activation pattern of olfactory sensory neurons 
reflects the diverse origins of peptides con- 
tained in the urinary peptidome — ligands of 
polymorphic MHC molecules and breakdown 
products of other polymorphic proteins, per- 
haps including MUPs. The structural basis 
for the recognition of this diverse repertoire 
of peptide ligands remains unclear; one pos- 
sibility is that, rather than expressing just one 
receptor, sensory neurons use hetero-oligo- 
meric receptors for peptide detection. 

From an evolutionary perspective, these 
ideas indicate a universal mechanism of 
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olfactory assessment of genetic individuality, 
even for animals that do not possess an MHC. 
But this work also has implications beyond 
animal behaviour. Olfactory recognition of 
MHC-bound peptides suggests that the exqui- 
site sensitivity and precision of sensory evalu- 
ation might be used to detect the presence of 
infectious agents, because peptides derived 
from these agents are ‘presented’ to cells of 
the immune system by MHC molecules and 
might eventually appear in the urine. And 
from a more practical point of view, sensory 
neurons may now present themselves as a 
biological alternative to mass spectrometers, 
for use when the analysis of peptide mixtures 
depends on accuracy over a large range of 
concentrations. 
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Molecular motion 


watched 


A laser pulse can switch certain crystals from an insulating phase to a highly 
conducting phase. The ultrafast molecular motions that drive the transition have 
been directly observed using electron diffraction. SEE LETTER P.343 


BRADLEY SIWICK & ERIC COLLET 


ichard Feynman once said, “everything 
R= living things do can be understood 
in terms of the jiggling and wiggling 
of atoms”. It is not only biological processes, 
however, that are driven by the microscopic 
structural rearrangements of atoms and mol- 
ecules — this idea cuts through many scientific 
disciplines. The underlying motions are usu- 
ally hidden from view, because even the most 
advanced conventional methods have been 
unable to directly observe the dynamic evo- 
lution of atomic (or molecular) structure on 
its shortest timescales. But on page 343 of this 
issue, Gao et al.' report their use of ultrafast 
electron diffraction to provide a detailed look 
at one such hidden world. The authors have 
effectively ‘watched’ molecules rearrange in a 
crystal at the picosecond timescale on which 
chemical bonds are made and broken (1 pico- 
second (ps) is 10°” seconds). 
Specifically, Gao et al. have investigated 
a crystalline organic salt consisting of 


electron-donor molecules (ethylenedioxy- 
tetrathiafulvalene, EDO-TTF) and hexafluoro- 
phosphate counterions (PF, ). At room tem- 
perature, the donor (D) molecules in the 
crystal are organized into linear chains that 
exhibit a high degree of ‘inversion’ symme- 
try. This high symmetry means that electrons 
are evenly shared by D molecules along the 
chain and that the material has high (metallic) 
conductivity (Fig. 1). 

At temperatures below 280 kelvin, however, 
the electrons are trapped on some pairs of D 
molecules but not on others. The molecules 
without electrons become positively charged, 
giving rise to a periodic D*'D*™'D°D° molecular 
sequence along the chain (the numbers in the 
sequence indicate the charge on each donor). 
This lowers the symmetry of the system and 
prevents electrons from jumping easily from 
pair to pair, so that the crystal becomes an 
insulator. The salt also undergoes an ultrafast 
light-induced transition’ from the insulat- 
ing phase to the metallic phase. Gao and col- 
leagues set out to uncover the nature of the 
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Figure 1 | Molecular motion of a phase switch. In the insulating state of the crystalline organic salt 
studied by Gao et al.', electron-donor molecules are either positively charged and straight, or neutral 

and bent. This reduces the symmetry of the system and traps positive charge (dashed red boxes) on pairs 
of adjacent molecules, preventing conduction — much like an open switch in a circuit (lower panel). In 
the salt’s metallic state, the symmetry of the system is higher because the molecules all carry equivalent 
charge, which delocalizes throughout the crystal (red shading), and allows conduction (closed switch and 
red wires in the circuit below). Gao et al. report that, 2 picoseconds after the insulating phase of the salt 

is excited by a laser pulse, a transient intermediate structure forms as a result of molecular motions (blue 
arrows). This intermediate contains localized structures in which electrons reside on alternate molecules. 
The intermediate state converts into the metallic state after 100 ps. 
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light-induced molecular rearrangements that 
act as a switch for this transition. 

Transitions like this one represent a new 
way in which matter can be manipulated by 
light*. In some materials, the subtle coupling 
between electronic and structural degrees of 
freedom following exposure to light seems 
to be strong enough to drive the system into 
an excited phase that has different electronic 
and structural order than the ground state, 
and which cannot be accessed through more 
conventional routes (for example, by using 
temperature, pressure, or electric or mag- 
netic fields as control parameters). In order to 
control such ultrafast transitions for techno- 
logical applications, the pathway along which 
structural and electronic changes occur must 
be observed in real time. How can this be 
accomplished? 

Gao and co-workers answer for the (EDO- 
TTF),PF, salt was to follow molecular motions 
through their effect on the electron-diffraction 
pattern of the crystal by combining tools and 
techniques from electron microscopy with 
those of ultrafast laser spectroscopy. The 
remarkable recent improvements in the spa- 
tial resolution of electron microscopes (now 
below 0.5 angstréms) have been parallelled by 
enhancements in the temporal resolution of 
both electron imaging and diffraction’. Gao 
and colleagues’ work raises the bar of achieve- 
ment for electron-diffraction instrumentation, 
and was enabled by a series of breakthroughs 
that resulted from first understanding’ and 
then manipulating the evolution of ultrashort 
electron pulses using particle-accelerator 
technology’. 

The power of this technology is evident 
from the authors’ results. By collecting time- 
resolved electron-crystallography data of 
unprecedented quality from photoexcited 
(EDO-TTF),PF,, Gao et al. mapped the pri- 
mary molecular motions involved in the 
earliest stages of the salt’s structural phase 
transition. These motions occur on timescales 
from a few hundred femtoseconds (fs; 1 femto- 
second is 10°'* seconds) to several picoseconds, 
and were therefore previously uncharacterized. 
The authors compared the molecular arrange- 
ment of the salt’s unit cell at these early stages 
to the crystal structure of the insulating and 
metallic phases at thermal equilibrium. In this 
way, they determined that a transient interme- 
diate structure unlike either equilibrium phase 
forms during the first few picoseconds after 
excitation. 

These results add insight to what was known 
from previous spectroscopic studies”’. The 
researchers observed that, in the intermediate 
structure, molecules rearrange to form local- 
ized D*'D°D"'D® sequences (Fig. 1). A second 
transformation then occurs to generate the 
long-range, high-symmetry stacking of the 
metallic structure. This second transformation 
was well characterized by the time course of 
symmetry-sensitive electron-diffraction peaks 


in Gao and colleagues’ experiments, which 
indicates that the equilibrium metallic phase 
is reached within about 100 ps. Because con- 
ductivity correlates with crystal symmetry, the 
low-symmetry intermediate structure is more 
like an insulator than a metal. 

Recent years have also seen tremendous 
success in the use of time-resolved X-ray dif- 
fraction to obtain atomic-level views of tran- 
sient phenomena in inorganic, molecular and 
even biomolecular systems’®, notably with 
the emergence of X-ray free-electron lasers". 
Gao and colleagues’ work demonstrates that 
laboratory-scale, ultrafast electron-diffraction 
set-ups represent an attractive alternative to 
synchrotrons or X-ray free-electron lasers — 
massive installations that require electron 
accelerators kilometres in length. At pre- 
sent, however, the temporal resolution of the 
authors’ apparatus is limited to about 400 fs, 
and so it could not catch the faster, coherent 
molecular motion (previously evidenced by 
optical spectroscopy’) that drives the sys- 
tem to the transient intermediate state itself. 
Nevertheless, their study represents an 
experimental tour de force. 

Femtosecond electron diffraction opens up 
new possibilities for investigating weakly dif- 
fracting systems composed of light elements, 
such as organic materials and two-dimensional 
protein crystals, or for studying systems for 
which laser penetration is too small to perform 
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X-ray studies. More broadly, Gao and col- 
leagues’ work adds to the growing range of 
advanced structural studies, be they performed 
with electrons or X-rays, that are pushing the 
boundaries of experiments towards the limits 
of spatial and temporal resolution, opening 
the way towards greater understanding and 
control of molecular dynamics and material 
properties. m 
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Nitrogen deposition and 
forest carbon 


Human activity, such as agricultural fertilizer use, has increased the amount 
of nitrogen deposited onto forests from the atmosphere. The photosynthetic 
response to this in evergreen needleleaf forests has been quantified globally. 


BEVERLY LAW 


itrogen-containing compounds 
deposited from the atmosphere can 
affect the amount of carbon that is 
absorbed into ecosystems by photosynthesis. 
Writing in Global Biogeochemical Cycles, 
Fleischer et al.' reveal that photosynthesis by 
boreal and temperate evergreen forests rises 
with increased atmospheric nitrogen deposi- 
tion, but levels out when a threshold value of 
8 kilograms of nitrogen per hectare per year 
is reached. These findings highlight the need 
to clarify the connections between carbon and 
nitrogen in the environment, and to disentan- 
gle the effects of climate from those of nitrogen 
deposition on forests. 
The terrestrial biosphere is thought to take 


up about 30% of human-produced carbon 
dioxide from the atmosphere annually, lessen- 
ing the greenhouse effect of fossil-fuel emis- 
sions’. However, estimates of the size of this 
terrestrial sink are uncertain because of major 
gaps in our knowledge of the magnitude of the 
effects of the factors involved. One of these fac- 
tors is the variation in human-caused additions 
of nitrogen to the atmosphere over time and 
space. Climate change and atmospheric ferti- 
lization of plants by human sources of nitrogen 
and CO, probably affect plant growth rates all 
over the world, but our understanding of these 
effects is likely to remain incomplete for the 
foreseeable future’. 

Most plants cannot use nitrogen gas in 
the atmosphere for growth. They require 
it to be converted to usable forms, such as 
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ammonia — a process that forms part of the 
nitrogen cycle. However, the natural nitrogen 
cycle has been heavily influenced by human 
activities, which produce highly unstable 
forms of nitrogen known collectively as reac- 
tive nitrogen. The deposition of reactive nitro- 
gen from the atmosphere to forests occurs 
mainly as a result of agricultural fertilizer use 
and fossil-fuel combustion, and has increased 
from 15 million tonnes of human-produced 
reactive nitrogen per year in the 1860s to 
187 million tonnes by 2005 (ref. 4). Nitrogen 
deposition is expected to continue to increase 
in many regions, and has been predicted’ to 
almost double globally by 2050. 

The addition of reactive nitrogen to the 
atmosphere by humans affects climate, and 
the composition and function of terrestrial 
and aquatic ecosystems’. High levels of nitro- 
gen deposition have many damaging effects on 
ecosystems, but small additions can be benefi- 
cial for otherwise nitrogen-limited ecosystems, 
because they increase the uptake of CO, from 
the atmosphere by photosynthesis. The effect 
of nitrogen deposition on carbon sequestration 
by soil is less clear. Most studies suggest that 
the net effect is between 35 and 65 kilograms of 
carbon sequestered per kilogram of nitrogen”. 
Much of the variation can be attributed to dif- 
ferent patterns of growth and to the availability 
of other resources for growth. 

Although low levels of nitrogen deposi- 
tion might mitigate the effects of increased 
atmospheric CO, to some degree, 53-76% of 
this coincidental benefit is itself estimated to 
be offset globally’. This is because nitrogen 
deposition can stimulate net emissions of 
other greenhouse gases (methane and nitrous 
oxide) that are products of microbial activity 
in the soils of many ecosystems. Thus, the rela- 
tive contribution of nitrogen deposition to the 
strength of the terrestrial carbon sink remains 
uncertain. 

To address this problem, near-continuous 
atmospheric observations have been made 
from towers above vegetation canopies to pro- 
vide estimates of CO, uptake (photosynthesis) 
and release (respiration) from and to the 
atmosphere, respectively. Many of these sites 
have been running for more than a decade, 
and the data are summarized by the FLUXNET 
Project®. In combination with biological and 
environmental data, such information has 
been used by researchers to examine the effect 
of factors such as climate variability on car- 
bon processes at the whole-ecosystem scale at 
many sites around the world. 

Fleischer et al. used FLUXNET data from 
80 sites that had sufficient information about 
nitrogen and carbon fluxes for their analysis. 
They found that, for evergreen needleleaf for- 
ests in temperate and boreal zones, maximal 
photosynthesis under optimal environmen- 
tal conditions increased with continuous 
nitrogen-deposition rates up to a threshold of 
about 8 kg of nitrogen deposition per hectare 
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per year. Above this value, no further increase 
in photosynthesis was observed. 

Forests above the threshold are therefore 
at an intermediate stage of nitrogen satura- 
tion — a stage at which nitrogen availability 
exceeds microbial and plant demands, and 
can result in some nitrogen leaching from the 
ecosystem. Prolonged availability of excessive 
nitrogen can lead to more leaching, decreased 
growth and nutrient imbalances. By contrast, 
the evergreen needleleaf forests that responded 
most strongly to nitrogen deposition are in a 
nitrogen-limited range, within which photo- 
synthetic capacity increases with deposition. 
The observed threshold is a small fraction of 
the nitrogen that farmers would use annually 
in fertilizers. 

The authors also found that boreal evergreen 
needleleaf forests had a slightly lower photo- 
synthetic response overall to nitrogen deposi- 
tion. For both boreal and temperate evergreen 
forests, this translates to roughly 25 kg of 
carbon sequestered per kilogram of nitro- 
gen, less than the estimated global average*”” 
of 35-65. 

The results are confounded by the effects 
of climate on photosynthesis: the nitrogen- 
deposition effect may be larger or smaller than 
Fleischer and colleagues’ findings because part 
of the observed response is probably a result 
of climate. However, there is no evidence to 
suggest that the contribution of nitrogen depo- 
sition is zero. Although the authors attempted 
to determine the thresholds for other forest 
types, they were limited by the availability of 
biological data on both carbon and nitrogen 
processes. More comprehensive measurements 
of nitrogen stocks and cycling at the global net- 
work of carbon monitoring sites are required 


COMPLEX SYSTEMS 


to separate the effects of climate and nitrogen 
deposition on forests. 

The net emissions of all greenhouse gases — 
including CO,, methane and nitrous oxide 
— should be considered when examining 
the net effects of nitrogen deposition and cli- 
mate on ecosystems. A better understanding 
of how the connections between carbon and 
nitrogen in the environment could change in 
the future is also required. Nevertheless, Fleis- 
cher and co-workers’ study lays the ground- 
work needed to refine estimates of the effects 
of climate and nitrogen deposition on the 
terrestrial biosphere’s ability to remove car- 
bon from the atmosphere as CO, emissions 
increase. Such refinements will be necessary 
to improve predictions of the effects of these 
emissions on ecosystems at local, regional and 
continental scales. m 
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Spatial signatures 


of resilience 


Predicting when the dynamics of a complex system will undergo a sudden 
transition is difficult. New experiments show that the spatial distribution of 
organisms can indicate when such tipping points are near. SEE LETTER P.355 


STEPHEN R. CARPENTER 


he divergence of dynamics towards 
sharply different states occurs in com- 
plex systems in fields ranging from 
physics and physiology to ecology and social 
sciences. The thresholds for these critical 
transitions are often unknown until surpris- 
ing shifts occur. Establishing measures of the 
distance to a threshold — the resilience of a 
system — could allow researchers to compare 
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the stability of different systems or even antici- 
pate an impending transition. In a paper on 
page 355 of this issue, Dai et al.' present a novel 
resilience index and use ingenious laboratory 
experiments to support the theory underly- 
ing it*. Their approach is based on the spatial 
distribution of organisms and thereby adds, 
quite literally, a new dimension to attempts to 
predict transitions. 


*This article and the paper under discussion were 
published online on 10 April 2013. 
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Figure 1 | Paths to recovery. Dai and colleagues' studied a series of linked cultures of yeast populations 
subject to catastrophic collapse when the dilution factor is raised above a critical threshold. They then 
introduced a ‘bad patch (position 0) in the form ofa culture with a high dilution factor. Recovery length 
is the distance that a population must be from a bad patch for it to recover 90% of its original high 
concentration (a population size of 1). When the entire series of cultures was highly resilient (far from 
the threshold) the recovery length was short (top panel), whereas when the series of cultures had low 
resilience (close to the threshold) the recovery length was long (bottom panel). 


Examples of the spatial distribution of 
organisms include the pattern of trees in a 
forest, of aquatic insects on the bottom of a 
stream, or of mussels along a rocky shoreline. 
In such ecosystems, a patch of unusually good 
habitat will be surrounded by a halo of patches 
with high population density, owing to the dif- 
fusion of organisms from the good patch to 
neighbouring sites. Likewise, a patch of poor 
habitat creates a halo of low population density. 
As the distance away from the patch of unusual 
habitat increases, the population density will 
gradually return to the average for the land- 
scape as a whole. The distance required for 
the population density to return to the average 
value is referred to as the recovery length. Dai 
and colleagues show that the recovery length 
provides an indication of the resilience of the 
landscape pattern (Fig. 1). 

The proposal to use recovery length as a 
resilience index is in contrast to most research 
on resilience indicators, which has used time 
series without spatial information’, such as 
populations of fish, density of grasses, or algae 
concentrations in a lake. However, high-resolu- 
tion time series are rare in ecology and may not 
be available for crucial ecosystems when man- 
agers need to make decisions. Spatial snapshots 
of resilience offer unique advantages’: they do 
not require intensive time series, and they may 
often be reconstructed from archives of aerial 
photos or satellite images. 

To demonstrate the usefulness of recovery 
length, Dai et al. designed a system of yeast 
cultures that were physically linked and that 
exhibited alternate stable states of high or low 


cell concentration, to represent patches in an 
ecosystem. In their experiments, the propor- 
tion of the liquid culture medium replaced 
each day (the dilution factor) had a critical 
threshold. At a low dilution factor, the yeast 
concentration of a patch was at the high steady 
state. As the dilution factor was gradually 
increased, the yeast concentration of the patch 
fell smoothly until the threshold dilution rate 
was reached, and then fell sharply to the low 
steady state. Thus, the gradual increase in dilu- 
tion rate from a low value erodes the resilience 
of the system until the threshold is reached and 
the yeast population collapses. 

The experimenters introduced a ‘bad patch 
(with a high dilution factor and therefore 
a low yeast concentration) into a series of 
linked cultures with identical dilution fac- 
tors and yeast concentrations near the high 
steady state. As distance away from the bad 
patch increased, the yeast concentration 
increased gradually towards the high steady 
state (Fig. 1). The distance required for the 
yeast population to recover to the high steady 
state is the recovery length. As the dilution fac- 
tor, and therefore the resilience, of the entire 
series of cultures declined, the recovery length 
increased. 

Previous studies have shown that popu- 
lations become more variable and recover 
more slowly from disturbance as resilience 
declines**. But Dai and colleagues demonstrate 
that these indicators of resilience are damp- 
ened by movements of organisms between 
patches. Furthermore, their results indicate 
that the influence ofa patch on its neighbours 
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becomes more widespread as resilience 
declines (Fig. 1). So, as resilience decreases, 
the effects of a bad patch spread farther from 
the epicentre. Similar effects are known from 
spatial models of populations subjected to 
harvesting. Increases in recovery length may 
also be related to the decreasing abundance 
of large patches, as occurs in real landscapes 
when overgrazing or climate change trans- 
forms grasslands into deserts’. 

This study and others”* demonstrate how 
laboratory experiments can connect model 
studies of resilience to real-world field applica- 
tions. Simulation models are rapidly expand- 
ing our knowledge of the situations in which 
resilience indicators can or cannot be used, 
and of the statistical challenges of measur- 
ing resilience in specific situations. However, 
few field experiments use manipulated and 
reference ecosystems to evaluate resilience 
indicators’. We need intensive high-quality 
data collected in multiple ecosystems for 
many years before we will be able to measure 
resilience at the scales needed for ecosystem 
management. 

Dai and colleagues’ results suggest that 
assessments of recovery length may be a 
powerful approach for determining resil- 
ience in large-scale field studies. According 
to their findings, in resilient ecosystems, a 
patch of unusually good or bad quality cre- 
ated by experimenters should have limited 
spatial influence. But in an ecosystem close 
to collapse, a patch of unusual quality should 
have widespread effects. Instead of collecting 
detailed time series of population densities 
through laborious field work, researchers can 
assess resilience by measuring recovery length 
in maps derived from remotely sensed data at 
just a few time points. 

Thresholds for the collapse of harvested 
populations, the degradation of range- 
lands, toxic algal blooms in water supplies, 
species invasions and population extinc- 
tions will continue to challenge ecologists 
and natural-resource managers. We will 
never eliminate surprises. But recovery 
length may provide a tool that allows us to 
steer away from some thresholds before 
they are crossed. m 
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The African coelacanth genome provides 
insights into tetrapod evolution 
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The discovery of a living coelacanth specimen in 1938 was remarkable, as this lineage of lobe-finned fish was thought to 
have become extinct 70 million years ago. The modern coelacanth looks remarkably similar to many of its ancient 
relatives, and its evolutionary proximity to our own fish ancestors provides a glimpse of the fish that first walked on 
land. Here we report the genome sequence of the African coelacanth, Latimeria chalumnae. Through a phylogenomic 
analysis, we conclude that the lungfish, and not the coelacanth, is the closest living relative of tetrapods. Coelacanth 
protein-coding genes are significantly more slowly evolving than those of tetrapods, unlike other genomic features. 
Analyses of changes in genes and regulatory elements during the vertebrate adaptation to land highlight genes involved 
in immunity, nitrogen excretion and the development of fins, tail, ear, eye, brain and olfaction. Functional assays of 
enhancers involved in the fin-to-limb transition and in the emergence of extra-embryonic tissues show the importance 
of the coelacanth genome as a blueprint for understanding tetrapod evolution. 


In 1938 Marjorie Courtenay-Latimer, the curator of a small natural _ considered to be one of the most notable zoological finds of the twen- 
history museum in East London, South Africa, discovered a large, _ tieth century. Latimeria is the only living member of an ancient group 
unusual-looking fish among the many specimens delivered to her by _ of lobe-finned fishes that was known previously only from fossils and 
a local fish trawler. Latimeria chalumnae, named after its discoverer’, believed to have been extinct since the Late Cretaceous period, 
was over 1 m long, bluish in colour and had conspicuously fleshy fins approximately 70 million years ago (Myr ago). It was almost 15 years 
that resembled the limbs of terrestrial vertebrates. This discovery is _ before a second specimen of this elusive species was discovered in the 
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Comoros Islands in the Indian Ocean, and only 309 individuals have 
been recorded in the past 75 years (R. Nulens, personal communication)’. 
The discovery in 1997 of a second coelacanth species in Indonesia, 
Latimeria menadoensis, was equally surprising, as it had been assumed 
that living coelacanths were confined to small populations off the East 
African coast**. Fascination with these fish is partly due to their pre- 
historic appearance—remarkably, their morphology is similar to that 
of fossils that date back at least 300 Myr, leading to the supposition that, 
among vertebrates, this lineage is markedly slow to evolve’». Latimeria 
has also been of particular interest to evolutionary biologists, owing to 
its hotly debated relationship to our last fish ancestor, the fish that first 
crawled onto land’. In the past 15 years, targeted sequencing efforts 
have produced the sequences of the coelacanth mitochondrial genomes’, 
HOX clusters* and a few gene families”'®. Nevertheless, coelacanth 
research has felt the lack of large-scale sequencing data. Here we describe 
the sequencing and comparative analysis of the genome of L. chalumnae, 
the African coelacanth. 


Genome assembly and annotation 


The African coelacanth genome was sequenced and assembled using 
DNA from a Comoros Islands Latimeria chalumnae specimen (Sup- 
plementary Fig. 1). It was sequenced by Illumina sequencing tech- 
nology and assembled using the short read genome assembler 
ALLPATHS-LG". The L. chalumnae genome has been reported previ- 
ously to have a karyotype of 48 chromosomes’’. The draft assembly is 
2.86 gigabases (Gb) in size and is composed of 2.18 Gb of sequence plus 
gaps between contigs. The coelacanth genome assembly has a contig 
N50 size (the contig size above which 50% of the total length of the 
sequence assembly can be found) of 12.7 kilobases (kb) and a scaffold 
N50 size of 924 kb, and quality metrics comparable to other lumina 
genomes (Supplementary Note 1, and Supplementary Tables 1 and 2). 
The genome assembly was annotated separately by both the Ensembl 
gene annotation pipeline (Ensembl release 66, February 2012) and by 
MAKER”. The Ensembl gene annotation pipeline created gene models 
using protein alignments from the Universal Protein Resource (Uni- 
prot) database, limited coelacanth complementary DNA data, RNA-seq 
data generated from L. chalumnae muscle (18 Gb of paired-end reads 
were assembled using Trinity software’, Supplementary Fig. 2) as well 
as orthology with other vertebrates. This pipeline produced 19,033 
protein-coding genes containing 21,817 transcripts. The MAKER 
pipeline used the L. chalumnae Ensembl gene set, Uniprot protein 
alignments, and L. chalumnae (muscle) and L. menadoensis (liver 
and testis)’ RNA-seq data to create gene models, and this produced 
29,237 protein-coding gene annotations. In addition, 2,894 short non- 
coding RNAs, 1,214 long non-coding RNAs, and more than 24,000 
conserved RNA secondary structures were identified (Supplementary 
Note 2, Supplementary Tables 3 and 4, Supplementary Data 1-3 and 
Supplementary Fig. 3). It was inferred that 336 genes underwent spe- 
cific duplications in the coelacanth lineage (Supplementary Note 3, 
Supplementary Tables 5 and 6, and Supplementary Data 4). 


The closest living fish relative of tetrapods 

The question of which living fish is the closest relative to ‘the fish that 
first crawled on to land’ has long captured our imagination: among 
scientists the odds have been placed on either the lungfish or the 
coelacanth’®. Analyses of small to moderate amounts of sequence data 
for this important phylogenetic question (ranging from 1 to 43 genes) 
has tended to favour the lungfishes as the extant sister group to the 
land vertebrates'’. However, the alternative hypothesis that the lung- 
fish and the coelacanth are equally closely related to the tetrapods 
could not be rejected with previous data sets’*. 

To seek a comprehensive answer we generated RNA-seq data from 
three samples (brain, gonad and kidney, and gut and liver) from the 
West African lungfish, Protopterus annectens, and compared it to gene 
sets from 21 strategically chosen jawed vertebrate species. To performa 
reliable analysis we selected 251 genes in which a 1:1 orthology ratio 
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was clear and used CAT-GTR, a complex site-heterogeneous model of 
sequence evolution that is known to reduce tree-reconstruction arte- 
facts’? (see Supplementary Methods). The resulting phylogeny, based 
on 100,583 concatenated amino acid positions (Fig. 1, posterior prob- 
ability = 1.0 for the lungfish-tetrapod node) is maximally supported 
except for the relative positions of the armadillo and the elephant. It 
corroborates known vertebrate phylogenetic relationships and 
strongly supports the conclusion that tetrapods are more closely 
related to lungfish than to the coelacanth (Supplementary Note 4 
and Supplementary Fig. 4). 


The slowly evolving coelacanth 


The morphological resemblance of the modern coelacanth to its fossil 
ancestors has resulted in it being nicknamed ‘the living fossil’. This 
invites the question of whether the genome of the coelacanth is as 
slowly evolving as its outward appearance suggests. Earlier work 
showed that a few gene families, such as Hox and protocadherins, 
have comparatively slower protein-coding evolution in coelacanth 
than in other vertebrate lineages*’°. To address the question, we 
compared several features of the coelacanth genome to those of other 
vertebrate genomes. 

Protein-coding gene evolution was examined using the phyloge- 
nomics data set described above (251 concatenated proteins) (Fig. 1). 
Pair-wise distances between taxa were calculated from the branch 
lengths of the tree using the two-cluster test proposed previously” 
to test for equality of average substitution rates. Then, for each of 
the following species and species clusters (coelacanth, lungfish, 
chicken and mammals), we ascertained their respective mean distance 
to an outgroup consisting of three cartilaginous fishes (elephant 
shark, little skate and spotted catshark). Finally, we tested whether 
there was any significant difference in the distance to the outgroup of 
cartilaginous fish for every pair of species and species clusters, using a 
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Figure 1 | A phylogenetic tree of a broad selection of jawed vertebrates 
shows that lungfish, not coelacanth, is the closest relative of tetrapods. 
Multiple sequence alignments of 251 genes with a 1:1 ratio of orthologues in 
22 vertebrates and with a full sequence coverage for both lungfish and 
coelacanth were used to generate a concatenated matrix of 100,583 
unambiguously aligned amino acid positions. The Bayesian tree was inferred 
using PhyloBayes under the CAT + GTR + I°y model with confidence estimates 
derived from 100 gene jack-knife replicates (support is 100% for all clades but 
armadillo + elephant with 45%)*®. The tree was rooted on cartilaginous fish, and 
shows that the lungfish is more closely related to tetrapods than the coelacanth, 
and that the protein sequence of coelacanth is evolving slowly. Pink lines 
(tetrapods) are slightly offset from purple lines (lobe-finned fish), to indicate 
that these species are both tetrapods and lobe-finned fish. 
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Z statistic. When these distances to the outgroup of cartilaginous fish 
were compared, we found that the coelacanth proteins that were 
tested were significantly more slowly evolving (0.890 substitutions 
per site) than the lungfish (1.05 substitutions per site), chicken (1.09 
substitutions per site) and mammalian (1.21 substitutions per site) 
orthologues (P< 10° ° inall cases) (Supplementary Data 5). In addition, 
as can be seen in Fig. 1, the substitution rate in coelacanth is approxi- 
mately half that in tetrapods since the two lineages diverged. A Tajima’s 
relative rate test” confirmed the coelacanth’s significantly slower rate 
of protein evolution (P< 10 7°) (Supplementary Data 6). 

We next examined the abundance of transposable elements in the 
coelacanth genome. Theoretically, transposable elements may make 
their greatest contribution to the evolution of a species by generating 
templates for exaptation to form novel regulatory elements and exons, 
and by acting as substrates for genomic rearrangement’. We found 
that the coelacanth genome contains a wide variety of transposable- 
element superfamilies and has a relatively high transposable-element 
content (25%); this number is probably an underestimate as this is a 
draft assembly (Supplementary Note 5 and Supplementary Tables 
7-10). Analysis of RNA-seq data and of the divergence of individual 
transposable-element copies from consensus sequences show that 
14 coelacanth transposable-element superfamilies are currently active 
(Supplementary Note 6, Supplementary Table 10 and Supplementary 
Fig. 5). We conclude that the current coelacanth genome shows both 
an abundance and activity of transposable elements similar to many 
other genomes. This contrasts with the slow protein evolution observed. 

Analyses of chromosomal breakpoints in the coelacanth genome 
and tetrapod genomes reveal extensive conservation of synteny and 
indicate that large-scale rearrangements have occurred at a generally 
low rate in the coelacanth lineage. Analyses of these rearrangement 
classes detected several fission events published previously” that are 
known to have occurred in tetrapod lineages, and at least 31 inter- 
chromosomal rearrangements that occurred in the coelacanth lineage 
or the early tetrapod lineage (0.063 fusions per 1 Myr), compared to 
20 events (0.054 fusions per 1 Myr) in the salamander lineage and 
21 events (0.057 fusions per 1 Myr) in the Xenopus lineage” (Sup- 
plementary Note 7 and Supplementary Fig. 6). Overall, these analyses 
indicate that karyotypic evolution in the coelacanth lineage has 
occurred at a relatively slow rate, similar to that of non-mammalian 
tetrapods”. 

Ina separate analysis we also examined the evolutionary divergence 
between the two species of coelacanth, L. chalumnae and L. menadoensis, 
found in African and Indonesian waters, respectively. Previous ana- 
lysis of mitochondrial DNA showed a sequence identity of 96%, but 
estimated divergence times range widely from 6 to 40 Myr?°. When 
we compared the liver and testis transcriptomes of L. menadoensis” 
to the L. chalumnae genome, we found an identity of 99.73% (Sup- 
plementary Note 8 and Supplementary Fig. 7), whereas alignments 
between 20 sequenced L. menadoensis bacterial artificial chromosomes 
(BACs) and the L. chalumnae genome showed an identity of 98.7% 
(Supplementary Table 11 and Supplementary Fig. 8). Both the genic 
and genomic divergence rates are similar to those seen between the 
human and chimpanzee genomes (99.5% and 98.8%, respectively; 
divergence time of 6 to 8 Myr ago)**, whereas the rates of molecular 
evolution in Latimeria are probably affected by several factors, includ- 
ing the slower substitution rate seen in coelacanth. This suggests a 
slightly longer divergence time for the two coelacanth species. 


The adaptation of vertebrates to land 
As the species with a sequenced genome closest to our most recent 
aquatic ancestor, the coelacanth provides a unique opportunity to 
identify genomic changes that were associated with the successful 
adaptation of vertebrates to the land environment. 

Over the 400 Myr that vertebrates have lived on land, some genes 
that are unnecessary for existence in their new environment have been 
eliminated. To understand this aspect of the water-to-land transition, 
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we surveyed the Latimeria genome annotations to identify genes that 
were present in the last common ancestor of all bony fish (including 
the coelacanth) but that are missing from tetrapod genomes. More 
than 50 such genes, including components of fibroblast growth factor 
(FGF) signalling, TGF-B and bone morphogenic protein (BMP) sig- 
nalling, and WNT signalling pathways, as well as many transcription 
factor genes, were inferred to be lost based on the coelacanth data 
(Supplementary Data 7 and Supplementary Fig. 9). Previous studies of 
genes that were lost in this transition could only compare teleost fish 
to tetrapods, meaning that differences in gene content could have 
been due to loss in the tetrapod or in the lobe-finned fish lineages. 
We were able to confirm that four genes that were shown previously to 
be absent in tetrapods (And1 and And2 (ref. 29), Fef24 (ref. 30) and 
Asip2 (ref. 31)), were indeed present and intact in Latimeria, support- 
ing the idea that they were lost in the tetrapod lineage. 

We functionally annotated more than 50 genes lost in tetrapods 
using zebrafish data (gene expression, knock-downs and knockouts). 
Many genes were classified in important developmental categories 
(Supplementary Data 7): fin development (13 genes); otolith and 
ear development (8 genes); kidney development (7 genes); trunk, 
somite and tail development (11 genes); eye (13 genes); and brain 
development (23 genes). This implies that critical characters in the 
morphological transition from water to land (for example, fin-to-limb 
transition and remodelling of the ear) are reflected in the loss of 
specific genes along the phylogenetic branch leading to tetrapods. 
However, homeobox genes, which are responsible for the develop- 
ment of an organism’s basic body plan, show only slight differences 
between Latimeria, ray-finned fish and tetrapods; it would seem that 
the protein-coding portion of this gene family, along with several 
others (Supplementary Note 9, Supplementary Tables 12-16 and Sup- 
plementary Fig. 10), have remained largely conserved during the 
vertebrate land transition (Supplementary Fig. 11). 

As vertebrates transitioned to a new land environment, changes 
occurred not only in gene content but also in the regulation of existing 
genes. Conserved non-coding elements (CNEs) are strong candidates 
for gene regulatory elements. They can act as promoters, enhancers, 
repressors and insulators***’, and have been implicated as major faci- 
litators of evolutionary change”. To identify CNEs that originated in 
the most recent common ancestor of tetrapods, we predicted CNEs 
that evolved in various bony vertebrate (that is, ray-finned fish, coela- 
canth and tetrapod) lineages and assigned them to their likely branch 
points of origin. To detect CNEs, conserved sequences in the human 
genome were identified using MULTIZ alignments of bony vertebrate 
genomes, and then known protein-coding sequences, untranslated 
regions (UTRs) and known RNA genes were excluded. Our ana- 
lysis identified 44,200 ancestral tetrapod CNEs that originated after 
the divergence of the coelacanth lineage. They represent 6% of the 
739,597 CNEs that are under constraint in the bony vertebrate lin- 
eage. We compared the ancestral tetrapod CNEs to mouse embryo 
ChIP-seq (chromatin immunoprecipitation followed by sequencing) 
data obtained using antibodies against p300, a transcriptional coacti- 
vator. This resulted in a sevenfold enrichment in the p300 binding 
sites for our candidate CNEs and confirmed that these CNEs are 
indeed enriched for gene regulatory elements. 

Each tetrapod CNE was assigned to the gene whose transcription 
start site was closest, and gene-ontology category enrichment was cal- 
culated for those genes. The most enriched categories were involved 
with smell perception (for example, sensory perception of smell, 
detection of chemical stimulus and olfactory receptor activity). This 
is consistent with the notable expansion of olfactory receptor family 
genes in tetrapods compared with teleosts, and may reflect the neces- 
sity of a more tightly regulated, larger and more diverse repertoire of 
olfactory receptors for detecting airborne odorants as part of the 
terrestrial lifestyle. Other significant categories include morphoge- 
nesis (radial pattern formation, hind limb morphogenesis, kidney mor- 
phogenesis) and cell differentiation (endothelial cell fate commitment, 
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epithelial cell fate commitment), which is consistent with the body- 
plan changes required for land transition, as well as immunoglobulin 
VDJ recombination, which reflects the presumed response differences 
required to address the novel pathogens that vertebrates would encoun- 
ter on land (Supplementary Note 10 and Supplementary Tables 17-24). 

A major innovation of tetrapods is the evolution of limbs charac- 
terized by digits. The limb skeleton consists of a stylopod (humerus or 
femur), the zeugopod (radius and ulna, or tibia and fibula), and an 
autopod (wrist or ankle, and digits). There are two major hypotheses 
about the origins of the autopod; that it was a novel feature of tetra- 
pods, and that it has antecedents in the fins of fish** (Supplementary 
Note 11 and Supplementary Fig. 12). We examine here the Hox 
regulation of limb development in ray-finned fish, coelacanth and 
tetrapods to address these hypotheses. 

In mouse, late-phase digit enhancers are located in a gene desert 
that is proximal to the HOX-D cluster*®. Here we provide an align- 
ment of the HOX-D centromeric gene desert of coelacanth with those 
of tetrapods and ray-finned fishes (Fig. 2a). Among the six cis-regulatory 
sequences previously identified in this gene desert*’, three sequences 
show sequence conservation restricted to tetrapods (Supplementary 
Fig. 13). However, one regulatory sequence (island 1) is shared by tetra- 
pods and coelacanth, but not by ray-finned fish (Fig. 2b and Supplemen- 
tary Fig. 14). When tested in a transient transgenic assay in mouse, the 
coelacanth sequence of island 1 was able to drive reporter expression ina 
limb-specific pattern (Fig. 2c). This suggests that island 1 was a lobe- 
fin developmental enhancer in the fish ancestor of tetrapods that was 
then coopted into the autopod enhancer of modern tetrapods. In this 
case, the autopod developmental regulation was derived from an ances- 
tral lobe-finned fish regulatory element. 

Changes in the urea cycle provide an illuminating example of the 
adaptations associated with transition to land. Excretion of nitrogen is 
a major physiological challenge for terrestrial vertebrates. In aquatic 
environments, the primary nitrogenous waste product is ammonia, 
which is readily diluted by surrounding water before it reaches toxic 
levels, but on land, less toxic substances such as urea or uric acid must 
be produced instead (Supplementary Fig. 15). The widespread and 
almost exclusive occurrence of urea excretion in amphibians, some 
turtles and mammals has led to the hypothesis that the use of urea as 
the main nitrogenous waste product was a key innovation in the 
vertebrate transition from water to land’’. 

With the availability of gene sequences from coelacanth and lungfish, 
it became possible to test this hypothesis. We used a branch-site model 
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in the HYPHY package”, which estimates the ratio of synonymous (dS) 
to non-synonymous (dN) substitutions (@ values) among different 
branches and among different sites (codons) across a multiple-species 
sequence alignment. For the rate-limiting enzyme of the hepatic urea 
cycle, carbamoyl phosphate synthase I (CPS1), only one branch of the 
tree shows a strong signature of selection (P = 0.02), namely the branch 
leading to tetrapods and the branch leading to amniotes (Fig. 3); no 
other enzymes in this cycle showed a signature of selection. Conversely, 
mitochondrial arginase (ARG2), which produces extrahepatic urea as a 
byproduct of arginine metabolism but is not involved in the production 
of urea for nitrogenous waste disposal, did not show any evidence of 
selection in vertebrates (Supplementary Fig. 16). This leads us to con- 
clude that adaptive evolution occurred in the hepatic urea cycle during 
the vertebrate land transition. In addition, it is interesting to note that 
of the five amino acids of CPS1 that changed between coelacanth and 
tetrapods, three are in important domains (the two ATP-binding sites 
and the subunit interaction domain) and a fourth is known to cause a 
malfunctioning enzyme in human patients if mutated”. 

The adaptation to a terrestrial lifestyle necessitated major changes in 
the physiological environment of the developing embryo and fetus, 
resulting in the evolution and specialization of extra-embryonic mem- 
branes of the amniote mammals”. In particular, the placenta is a com- 
plex structure that is critical for providing gas and nutrient exchange 
between mother and fetus, and is also a major site of haematopoiesis”’. 

We have identified a region of the coelacanth HOX-A cluster that 
may have been involved in the evolution of extra-embryonic struc- 
tures in tetrapods, including the eutherian placenta. Global alignment 
of the coelacanth Hoxal4-Hoxal3 region with the homologous 
regions of the horn shark, chicken, human and mouse revealed a 
CNE just upstream of the coelacanth Hoxal4 gene (Supplementary 
Fig. 17a). This conserved stretch is not found in teleost fishes but is 
highly conserved among horn shark, chicken, human and mouse 
despite the fact that the chicken, human and mouse have no Hoxal4 
orthologues, and that the horn shark Hoxa14 gene has become a pseu- 
dogene. This CNE, HA14E1, corresponds to the proximal promoter- 
enhancer region of the Hoxal4 gene in Latimeria. HA14E1 is more 
than 99% identical between mouse, human and all other sequenced 
mammals, and would therefore be considered to be an ultra-conserved 
element’. The high level of conservation suggests that this element, 
which already possessed promoter activity, may have been coopted for 
other functions despite the loss of the Hoxal4 gene in amniotes 
(Supplementary Fig. 17bc). Expression of human HA14E]1 in a mouse 
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indicated by arrowheads in the first two 
panels. The third panel shows a close- 
up of a limb bud. 
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Figure 3 | Phylogeny of Cps1 coding sequences is used to determine positive 
selection within the urea cycle. Branch lengths are scaled to the expected 
number of substitutions per nucleotide, and branch colours indicate the 
strength of selection (dN/dS or @). Red, positive or diversifying selection 

(@ > 5); blue, purifying selection (@ = 0); yellow, neutral evolution (@ = 1). 
Thick branches indicate statistical support for evolution under episodic 
diversifying selection. The proportion of each colour represents the fraction of 
the sequence undergoing the corresponding class of selection. 


transient transgenic assay did not give notable expression in the embryo 
proper at day 11.5 (information is available online at the VISTA en- 
hancer browser website; http://enhancer.lbl.gov/cgi-bin/imagedb3.pl? 
form=presentation&show= 1 &experiment_id=501&organism_id=1), 
which was unexpected as its location would predict that it would regu- 
late axial structures caudally**. A similar experiment in chick embryos 
using the chicken HA14E] also showed no activity in the anteroposter- 
ior axis. However, strong expression was observed in the extraem- 
bryonic area vasculosa of the chick embryo (Fig. 4a). Examination of a 
Latimeria BAC Hoxa14-reporter transgene in mouse embryos showed. 
that the Hoxal4 gene is specifically expressed in a subset of cells in an 
extra-embryonic region at embryonic day 8.5 (Fig. 4b). 

These findings suggest that the HA14E1 region may have been 
evolutionarily recruited to coordinate regulation of posterior HOX- 
A genes (Hoxal3, Hoxall and Hoxal0), which are known to be 
expressed in the mouse allantois and are critical for early formation 
of the mammalian placenta. Although Latimeria does not possess a 
placenta, it gives birth to live young and has very large, vascularised 
eggs, but the relationship between Hoxa14, the HA14E1 enhancer and 
blood island formation in the coelacanth remains unknown. 


The coelacanth lacks immunoglobulin-M 


Immunoglobulin-M (IgM), a class of antibodies, has been reported in 
all vertebrate species that have been characterized so far, and is con- 
sidered to be indispensable for adaptive immunity”. Interestingly, 
IgM genes cannot be found in coelacanth, despite an exhaustive search 
of the coelacanth sequence data, and even though all other major com- 
ponents of the immune system are present (Supplementary Note 12 
and Supplementary Fig. 18). Instead, we found two IgW genes (Sup- 
plementary Figs 19-21); immunoglobulin genes that are found only 
in lungfish and cartilaginous fish and are believed to have originated in 
the ancestor of jawed vertebrates** but subsequently lost in teleosts and 
tetrapods. IgM was similarly absent from the Latimeria RNA-seq data, 
although both IgW genes were found as transcripts. To characterize 
further the apparent absence of IgM, we screened large genomic L. 
menadoensis libraries exhaustively using a number of strategies and 
probes. We also carried out polymerase chain reaction (PCR) with 
degenerate primers that should universally amplify IgM sequences. 
The lack of IgM in Latimeria raises questions as to how coelacanth 
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Figure 4 | Transgenic analysis implicates involvement of Hox CNE HAI4E1 
in extraembryonic activities in the chick and mouse. a, Chicken HAI4E1 
drives reporter expression in blood islands in chick embryos. A construct 
containing chicken HA14E1 upstream of a minimal (thymidine kinase) 
promoter driving enhanced green fluorescent protein (eGFP) was 
electroporated in HH4-stage chick embryos together with a nuclear mCherry 
construct. GFP expression was analysed at stage approximately HH11. The 
green aggregations and punctate staining are observed in the blood islands and 
developing vasculature. b, Expression of Latimeria Hoxa14-reporter transgene 
in the developing placental labyrinth of a mouse embryo. A field of cells from 
the labyrinth region of an embryo at embryonic day 8.5 from a BAC transgenic 
line containing coelacanth Hoxa9-Hoxa14 (ref. 49) in which the Hoxal4 gene 
had been supplanted with the gene for red fluorescence protein (RFP). 
Immunohistochemistry was used to detect RFP (brown staining in a small 
number of cells). 


B cells respond to microbial pathogens and whether the IgW molecules 
can serve a compensatory function, even though there is no indication 
that the coelacanth IgW was derived from vertebrate IgM genes. 


Discussion 


Since its discovery, the coelacanth has been referred to as a ‘living 
fossil’, owing to its morphological similarities to its fossil ancestors’. 
However, questions have remained as to whether it is indeed evolving 
slowly, as morphological stasis does not necessarily imply genomic 
stasis. In this study, we have confirmed that the protein-coding genes 
of L. chalumnae show a decreased substitution rate compared to those 
of other sequenced vertebrates, even though its genome as a whole 
does not show evidence of low genome plasticity. The reason for this 
lower substitution rate is still unknown, although a static habitat anda 
lack of predation over evolutionary timescales could be contributing 
factors to a lower need for adaptation. A closer examination of gene 
families that show either unusually high or low levels of directional 
selection indicative of adaptation in the coelacanth may provide 
information on which selective pressures acted, and which pressures 
did not act, to shape this evolutionary relict (Supplementary Note 13 
and Supplementary Fig. 22). 

The vertebrate land transition is one of the most important steps in 
our evolutionary history. We conclude that the closest living fish to 
the tetrapod ancestor is the lungfish, not the coelacanth. However, the 
coelacanth is critical to our understanding of this transition, as the 
lungfish have intractable genome sizes (estimated at 50-100 Gb)*’. 
Here we have examined vertebrate adaptation to land through co- 
elacanth whole-genome analysis, and have shown the potential of 
focused analysis of specific gene families involved in this process. 
Further study of these changes between tetrapods and the coelacanth 
may provide important insights into how a complex organism like a 
vertebrate can markedly change its way of life. 


METHODS SUMMARY 

A full description of methods, including information on sample collection, 
sequencing, assembly, annotation, all sequence analysis and functional valid- 
ation, can be found in the Supplementary Information. 
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Gating of the TrKH ion channel by its 
associated RCK protein TrkA 


Yu Cao!*, Yaping Pan>?*, Hua Huang'**, Xiangshu Jin’, Elena J. Levin'?, Brian Kloss* & Ming Zhou? 


TrkH belongs to a superfamily of K* transport proteins required for growth of bacteria in low external K* 
concentrations. The crystal structure of TrkH from Vibrio parahaemolyticus showed that TrkH resembles a K* 
channel and may have a gating mechanism substantially different from K* channels. TrkH assembles with TrkA, a 
cytosolic protein comprising two RCK (regulate the conductance of K*) domains, which are found in certain K* 
channels and control their gating. However, fundamental questions on whether TrkH is an ion channel and how it is 
regulated by TrkA remain unresolved. Here we show single-channel activity of TrkH that is upregulated by ATP via 
TrkA. We report two structures of the tetrameric TrkA ring, one in complex with TrkH and one in isolation, in which the 
ring assumes two markedly different conformations. These results suggest a mechanism for how ATP increases TrkH 


activity by inducing conformational changes in TrkA. 


K* is concentrated in all living cells and is essential for many physio- 
logical processes. In bacteria, homeostasis of K* is largely mediated by 
specialized K* transport proteins known as the superfamily of K 
transport (SKT) proteins’. The bacterial TrkH, TrkG and KtrB pro- 
teins form the largest subfamily of SKT proteins, and the crystal 
structure of TrkH from Vibrio parahaemolyticus, VpTrkH, was 
reported recently’. Each of the four connected homologous domains 
of TrkH resembles a subunit of the KcsA K* channel, comprising two 
transmembrane helices connected by a re-entrant P-loop (M1-P- 
M2)’, and the four domains encircle a central ion permeation path- 
way. In addition, TrkH has two structural features atypical of K* 
channels: a dimeric quaternary structure, with each protomer con- 
taining its own pore; and a long membrane-embedded loop, inserted 
into the middle of a pore-lining helix, that occludes the ion permea- 
tion pathway. The loop is conserved across the TrkH/TrkG/KtrB 
family, and previous studies have shown that deletion of this intra- 
membrane loop increases KtrB-mediated K* uptake, indicating that 
it may act as a gate*”. 

The TrkH, TrkG and KtrB integral membrane proteins form a 
complex with an intracellular regulatory protein, and the genes 
encoding the membrane protein and the regulatory protein reside 
in the same operon in many bacteria. The protein that assembles with 
TrkH and TrkG is TrkA, whereas KtrB interacts with KtrA°*. Both 
TrkA and KtrA are composed of RCK domains"'®, which can be 
classified into two types based on their cognate ligands: bacterial 
RCK domains possessing conserved GXGXXG motifs that bind to 
nucleotides*!”'*!”!8, sometimes referred to as KTN domains, and 
the bacterial and mammalian RCKs that lack a nucleotide-binding 
motif and bind to ions such as Ca?" (refs 11, 15, 16, 19). The organ- 
ization of RCK domains varies considerably. For example, the KtrA 
and TrkA protomers contain one and two RCK domains, respectively, 
expressed as separate cytoplasmic proteins, whereas for K* channels 
the RCK domains are directly connected to the termini of their assoc- 
iated channels. Previous RCK structures suggest that regardless of 
gene architecture, a total of eight RCK domains associate in a config- 
uration known as a gating ring®!**'°"". 


The TrkH, TrkG and KtrB proteins were initially thought to be 
either H*- or Na -dependent K* transporters”’”, but their sequence 
and structural similarity to K* channels suggests that these proteins 
could function as ion channels. However, ion channel activity has 
never been demonstrated for any member of the TrkH/TrkG/KtrB 
family of proteins. Furthermore, if TrkH is an ion channel, what effect 
does its associated protein TrkA have on gating of the channel? 
Current models of gating in K* channels with RCK domains propose 
that a dilation in the diameter of the gating ring directly translates into 
movement of the four pore-lining helices and opening of the permea- 
tion pathway”'?>'°?, However, this model of a four-fold symmetric 
gating ring expansion seems to be incompatible with the dimeric archi- 
tecture of TrkH, and does not account for the intramembrane loop. In 
this study, we attempt to address these unresolved issues by character- 
izing the structure and function of the TrkKH-TrkA complex. 


Single-channel activities of the TrkKH-TrkA complex 


Patch clamp was performed on Escherichia coli spheroplasts hetero- 
logously expressing TrkH and TrkA from V. parahaemolyticus using 
an inside-out configuration. A single-channel current trace in 200 mM 
symmetrical K* solution is shown in Fig. la. Occasional bursts of 
channel activities were observed, and these bursts were interrupted by 
prolonged closures. Within each burst of openings there are two current 
levels: level 1 and 2, with level 2 approximately double the size of level 1 
in amplitude (Supplementary Table 1). Every burst of openings starts 
with a sojourn from the baseline directly to level 2 followed by openings 
alternating between levels 1 and 2, and terminates >91% of the time 
directly from level 2. This behaviour is inconsistent with two entirely 
independent channels, but strongly suggests a single TrkH dimer, the 
two pores of which open and close in a mostly concerted fashion. 
Asacontrol, currents from spheroplasts expressing only TrkH were 
also recorded. Similar bursts of channel activities with two levels of 
amplitudes were observed (Supplementary Table 1 and Supplementary 
Fig. 1a). However, the overall open probability of TrkH without TrkA 
is higher than that of the complex (0.65 + 0.03 versus 0.17 + 0.01), 
suggesting that TrkA affects the activity of TrkH. To verify that the 
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Figure 1 | Single-channel activities of the TrkH-TrkA complex. a, Single- 
channel currents through the TrkH-TrkA complex were recorded in 
symmetrical 200 mM KCI. The current trace is shown at a holding potential of 
+50 mV. The closed state is indicated as ‘C’. The grey rectangles show two 
individual bursts of channel openings. b, The open-channel current-voltage 
relationship. Current traces recorded at different voltages are shown in the left 
panel, and current amplitudes are plotted in the right panel. c, Current traces of 


single-channel currents were from TrkH, a cysteine substitution was 
made to a pore-lining residue (1220C) based on the known TrkH 
structure’. Perfusion of the cysteine-modifying reagent (2-(trimethy- 
lammonium)ethyl) methanethiosulphonate (MTSET)* reversibly 
reduced current amplitudes as well as the overall open probability 
for the 1220C mutant for either TrkH alone or the TrkH-TrkA com- 
plex (Supplementary Fig. 1b, c). Wild-type TrkH was not affected by 
MTSET (Supplementary Fig. 1c). 

The open-channel current-voltage (J-V plot) relationship is plot- 
ted in Fig. 1b and shows slight inward rectification: level 2 has a chord 
conductance of 350 pS at +60 mV and 550 pS at —60 mV. Selectivity 
of the TrkH ion channel was examined by changing the solution on 
the intracellular side of the patch. Although TrkH does not discri- 
minate Rb* and Cs* from K“, it conducts Na’ and Li* ata signifi- 
cantly slower rate (Supplementary Table 1 and Supplementary Fig. 2). 
Given that the K* channel signature sequence, TVGYG, which lines 
the selectivity filter, is not highly conserved in TrkH, the weak selecti- 
vity of TrkH is not surprising. In addition, substituting most of the 
intracellular Cl” with a large organic anion, gluconate, does not affect 
the reversal potential, indicating that the TrkH channel does not 
conduct Cl” (Supplementary Fig. 2d). 

Because TrkA contains a conserved nucleotide-binding motif, we 
examined channel activity in the presence of NAD*, NADH and 
ATP. Whereas perfusion of NADs to the intracellular side of the 
channel did not produce significant changes in the currents, perfusion 
of 5mM ATP substantially increased the open probability (Popen) of 
the TrkH-TrkA complex (Fig. lc and Supplementary Fig. 3). The 
increase in Popen is the result of both a reduced time between bursts 
and a prolonged burst duration (Fig. 1c). To verify that this effect was 
mediated by TrkA, we mutated R100 on TrkA, which forms part of 
the ATP-binding site (see ‘Structure of the isolated TrkA tetramer’ 
below), to alanine, and the mutant no longer responds to ATP with 
an increase in Popen (Fig. 1c). This increase in Popen does not require 
ATP hydrolysis, because two non-hydrolysable ATP analogues, 
adenosine 5'-O-(3-thiotriphosphate) (ATP-yS) and adenosine 5’-(f, 
y-imidotriphosphate) (AMP-PNP), both produced a similar increase 
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the TrkH-TrkA (wild type (WT)-WT) complex and the TrkH-TrkA (WT- 
R100A) complex in the absence (left) and presence (right) of 5 mM ATP or 
ADP. The Popen after perfusion of different ligands is plotted in the bar graph. 
Added ligand concentrations are as follows: ATP, 5 mM; ATP-yS, 1 mM; AMP- 
PNP, 1mM; ADP, 5mM; AMP, 5mM; NADH, 1 mM; NAD“, | mM. Error 
bars are s.e.m. of 3-9 independent measurements. 


in Popen (Fig. 1c). This result is consistent with earlier studies indi- 
cating that the Trk system in bacteria requires ATP** but is not an 
ATPase”. Notably, ADP had the opposite effect to that of ATP and 
significantly reduced the Poyen, whereas AMP had a negligible effect 
(Fig. 1c and Supplementary Fig. 3). 


Structure of the TrkH-TrkA complex 


To understand how TrkH and TrkA interact, we co-expressed the two 
proteins and crystallized the stable purified complex (Supplementary 
Fig. 4). Although NADH does not affect channel activity (Fig. 1c), 
inclusion of NADH in the crystallization solution improved crystal 
size and quality. The structure of the complex was solved at a resolution 
of 3.8A (Supplementary Fig. 5 and Supplementary Table 2). In the 
asymmetric unit, two TrkH dimers interact with a ring of four TrkA 
protomers; this probably occurs because the internal symmetry of the 
TrkA tetramer creates two equivalent interfaces on both sides of the 
ring, as discussed in detail below. In the membrane, one TrkH dimer 
would interact with one TrkA tetramer (Supplementary Fig. 6). 

The tetrameric TrkA gating ring associates with the cytoplasmic face 
of the TrkH dimer (Fig. 2a, b). The structure of TrkH in the complex is 
almost identical to the previously solved structure of TrkH in isolation 
(Supplementary Fig. 7a). In particular, the pore of the ion conduction 
pathway is still obstructed by the intramembrane loop (Fig. 2b inset 
and Supplementary Fig. 7b). Each TrkKH monomer interacts with TrkA 
through a cytosolic loop following the second half of the broken M2 
helix in domain 3, D3M2b (Fig. 2b and Supplementary Fig. 4), which is 
tilted to lie nearly parallel to the bilayer and is connected directly to the 
intramembrane loop. The interface includes a number of salt bridges 
between positively charged residues on the D3M2b-D4 loop and nega- 
tively charged residues on TrkA (Supplementary Fig. 8b). A number of 
residues on the cytosolic side of TrkH are missing from the electron 
density, including an 18-residue stretch on the loop connecting 
domains 1 and 2 (Supplementary Fig. 8a). Although this cytosolic loop 
is not highly conserved among the TrkG, TrkH and KtrB proteins, 
because of its proximity to TrkA it may be involved in additional 
contacts not resolved in the crystal structure. 
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Figure 2 | Structure of the TrkH-TrkA complex. a, Structure of the complex 
between the TrkH dimer (light blue and teal) and the TrkA tetramer (dark blue 
and purple), viewed from the periplasmic side. The TrkA protomer is shown 
as a surface representation. b, The TrkKH-TrkA complex viewed from within 

the plane of the membrane. Helix D3M2b in each TrkH subunit is highlighted 
in red. The inset on the right shows homologous domains 1 and 3 of TrkH, with 
the intramembrane loop and helix D3M2b highlighted. c, Schematic showing 
the organization of RCK domains ina TrkA subunit. d, e, Cartoon representation 


Fold and quaternary structure of TrkA RCK gating ring 


The TrkA protomer contains two tandem RCK domains, RCK1 and 
RCKz2, so that the gating ring contains a total of eight RCK domains. 
Each RCK domain comprises an amino-terminal lobe assuming a 
Rossman fold, connected to a smaller carboxy-terminal lobe by a 
‘hinge’ helix (Fig. 2c, d). A key difference between the TrkA structure 
and the BK and MthK channel-associated RCK structures''!*"° is 
that the TrkA tetramer has D, point group symmetry, involving three 
perpendicular, intersecting, two-fold rotational symmetry axes 
(Fig. 2d, f), whereas the BK gating ring has Cy, symmetry (Fig. 2e). 
As a consequence of its ‘dimer of dimers’ arrangement, the TrkA 
tetramer provides a two-fold symmetric interaction surface to match 
the two-fold symmetry of the TrkH dimer. Additionally, the tetramer 
contains two distinct types of subunit interfaces: N1-N1 interfaces, 
formed between the N-terminal lobes of the RCK1 domains; and 
N2-N2 interfaces, formed by the N-terminal lobes of the RCK2 
domains. In contrast, the four-fold symmetric BK gating ring contains 
four identical N1-N2 interfaces. 

The TrkA protomer contains two GXGXXG nucleotide-binding 
motifs”, one in the N-lobe of each RCK domain. The N2 lobe binding 
site in each TrkA protomer is occupied by a large electron density 
consistent with NADH, which was included as an additive to facilitate 
crystallization. At this low resolution the oxidized and reduced forms 
cannot be distinguished (Supplementary Fig. 9). The N1 lobe binding 
site is unoccupied. 


Structure of the isolated TrkA tetramer 

We were also able to obtain crystals of TrkA without TrkH and in the 
presence of 10 mM ATP-7S that diffracted to a resolution of 3.05 A, 
and solved the structure by molecular replacement (Fig. 3a and 
Supplementary Table 2). Isolated TrkA forms a tetramer in solution 
(Supplementary Fig. 10) as well as in the crystal lattice (Supplemen- 
tary Fig. 11). Each protomer contains two large densities consistent 
with ATP-7S molecules, with one bound to the GXGXXG motif in the 
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of a TrkA protomer (d) or a protomer from the BK channel gating ring (e), 
coloured by subdomain according to the colour scheme in panel c. Schematics 
illustrating the organization of the tetrameric gating ring with the two-fold or 
four-fold symmetry axes marked are shown on the right. The green outlines 
delineate an individual subunit in each tetramer. f, Surface representation of 
the TrkA tetramer in the TrkKH-TrkA complex, viewed from the periplasmic 
side (top) or parallel to the membrane (bottom). The green outline delineates 
an individual subunit. 


N lobe of each RCK domain, so that the gating ring contains a total of 
eight bound nucleotides (Supplementary Fig. 12). 

Comparison of the structure of isolated TrkA to the structure from 
the TrkH-TrkA complex shows a substantial conformational change 
(Fig. 3a). This change is generated by markedly different behaviour of 
the N1-N1 and N2-N2 interfaces. The N lobes of the N2-N2 interface 
rotate and slide past each other (Fig. 3b and Supplementary Video 1), 
with one N2-N2 interface undergoing a larger translation than the 
other (Supplementary Fig. 13 and Supplementary Video 2). In con- 
trast, the N1-N1 interface shows no change between the two structures 
(Fig. 3c and Supplementary Video 3). The different behaviour of the 
N1-NI1 and N2-N2 interfaces converts TrkA from a roughly planar 
conformation to a twisted conformation (Fig. 3a). 

The conformational change within individual protomers driving the 
‘flat’ to ‘twisted’ transition in the gating ring can be visualized by align- 
ing protomers from the ATP-yS-bound structure and the complex by 
their immobile N1 lobes only, which reveals rigid-body rotations of the 
N2, Cl and C2 lobes about the hinge helix connecting N1 to Cl (Fig. 3d 
and Supplementary Video 4). Given that the two structures were 
obtained at low resolution and bound to non-physiological ligands, it 
is difficult to identify specific protein-ligand interactions that may 
drive this conformational change. However, we note that the y-phos- 
phate groups on the bound ATP-7S molecules are located very close to 
the cleft between the two N lobes (Fig. 3e and Supplementary Fig. 13). 
The width of this cleft changes between the two structures, as measured 
by a change in the intersection angle of the hinge helices from 120° in 
the TrkH-TrkA structure to approximately 100° in the ATP-yS-bound 
TrkA structure (Fig. 3e). 

The change in the angle of intersection between hinge helices 
(Fig. 3e) is not unique to TrkA; similar movements have been 
observed in the KefC’*"*, KtrA™ and MthK”’ gating rings. In contrast, 
the interfaces corresponding to the intersubunit interfaces in TrkA 
(referred to as ‘assembly’ interfaces in MthK and BK channels) seem 
to vary in their behaviour between the different gating rings. In BK 
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Figure 3 | Comparison of TrkA in the TrkA-only and TrkH-TrkA complex 
structures. a, Surface representation of the isolated TrkA tetramer, viewed 
from the periplasmic side (top) or parallel to the membrane (bottom), coloured 
by subdomain according to the colour scheme in Fig. 2c. b, c, Superposition of 
the N2-N2 interface (b) and N1-N1 interface (c) in the TrkKH-TrkA (grey) and 
TrkA-only (blue or purple) structures. The alignment was based on the 
domains coloured black in both structures. d, A single TrkA protomer from the 
TrkH-TrkA complex structure (left) and a protomer from the TrkA-only 
structure (right) viewed from the same orientation after superposition by their 


and MthK channels, all four assembly interfaces are fixed, forcing the 
ring to accommodate the change in the angle of the N-lobe cleft 
through a uniform, four-fold symmetric dilation. In KtrA and TrkA, 
two of the four assembly interfaces are able to move relative to each 
other, resulting in a conformational change that is two-fold symmetric 
(Fig. 3f). It may be that the change in the angle of the cleft between the 
NI1-N2 lobes represents a conserved response of RCK domains to 
ligand binding that introduces a strain on the gating ring, and that 
differences in the architecture and strength of the intersubunit or 
assembly interfaces determines how this strain is relieved, resulting 
in an overall conformational change to the ring that is suitable for 
the particular associated channel. 


Mechanistic implications for channel regulation by TrkA 


Comparison of the two TrkA conformations suggests a possible 
mechanism for opening of the channel by nucleotide-induced con- 
formational changes on the gating ring. In the closed state bound to 
ADP, probably corresponding to the structure of the TrkKH-TrkA 
complex, the ion permeation pathway is blocked by the intramem- 
brane loop connected to helix D3M2b. Although the ‘twisted’ con- 
formation of TrkA was obtained in the absence of the channel, 
possible effects of this conformational change on the channel can 
be probed by mapping the locations of residues forming the interface 
in the TrkH-TrkA structure onto the TrkA-only structure (Fig. 4a). 
The surface residues interacting with the channel are located prim- 
arily in two patches on the diagonally positioned N2 domains facing 
the channel, each of which interacts with one of the channel D3M2b 
helices. These two interface regions are farther apart by ~24 A in the 
isolated TrkA structure (Fig. 4a and Supplementary Video 5). If ATP 
binding causes the gating ring to adopt a twisted conformation similar 
to the free TrkA structure, separation of the two interfaces on the 
tetramer could pull the helix D3M2b away from the centre of the 
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N1 subdomains. N1 is shown as surface representation, whereas the other three 
lobes are shown as cartoon representation. e, The N1 and N2 lobes in a subunit 
of TrkA from the NADH-bound TrkH-TrkA complex (top) and the ATP-yS- 
bound TrkA (bottom) structures are shown with their hinge helices 
highlighted. NADH or ATP-yS are shown in green. f, Cartoons of the TrkA, BK 
and KtrA gating rings showing only the N lobes. The hinge helices are marked 
with red rectangles. Description of interfaces as ‘fixed’ or ‘rotating’ is based on 
comparison of structures 2HMS and 2HMW (Protein Data Bank accessions) 
for KtrA, and 3NAF and 3U6N (Protein Data Bank accessions) for BK. 


ion permeation pathway, leading to movement of the loop out of 
the pore and opening of the channel (Fig. 4b). As a first test of whether 
the intramembrane loop is in fact required for the complex’s response 
to ATP, currents were measured from the complex of the wild-type 
TrkA and a TrkH mutant with residues 348-354 of the intramem- 
brane loop deleted. The loop deletion mutant had a higher basal open 
probability than the wild-type channel, and was largely insensitive to 
perfusion of ATP (Fig. 4c). Although this result is consistent with the 
hypothesized role of the intramembrane loop in gating of TrkH by 
TrkA, further structural and functional studies are necessary to reveal 
the molecular mechanism governing regulation of channel activity. 


Discussion 

This study presents, to our knowledge, the first demonstration that 
TrkH, and by extrapolation the TrkG and KtrB family of proteins, are 
ion channels. Additionally, we demonstrate that ion flux through the 
TrkH-TrkA complex is upregulated by ATP and downregulated by 
ADP. It is likely that in a cell the ratio of ATP to ADP determines the 
activity of TrkH, thus providing a way for regulating ion conduction 
in response to the metabolic activity of an organism. However, it is 
puzzling that a protein that promotes growth of bacteria in low 
external K* concentrations only has a modest selectivity between 
K* and Na’. 

One curious feature of the Na* and Li* currents is that the ampli- 
tude of level 2 was not twice that of level 1. Similar behaviour was 
observed for the 1220C channels modified by MTSET (Supplementary 
Figs 1, 2 and Supplementary Table 1). One possible interpretation of 
this result is that under these conditions the conductance through an 
individual channel is lower when its partner is closed than when both 
are open. Because the D3M2b helix is connected to the intramem- 
brane loop and also forms significant contacts with the neighbouring 
protomer, it represents a possible avenue for coupling of gating in one 


©2013 Macmillan Publishers Limited. All rights reserved 


a TrkH-TrkA + NADH 


Isolated TrkA + ATP-yS 


c TrkH-TrkA (Aloop-WT) 0.9 


No ATP 
ed 0.6 
=] 
35 a‘ 
0.3 | | 
Ae 


-ATP +ATP —ATP + ATP 


By TrkH-Trka (WT-wT) 
(O TrkH-TrkA (Aloop-WT) 


open 


+ ATP 


Figure 4 | Proposed mechanism of regulation of TrkH gating by TrkA. 

a, Surface representation of the TrkA tetramer from the TrkA-only (top) and 
TrkH-TrkA structures, viewed from the membrane-facing side. Residues 
forming the channel-gating ring interface, defined as residues with at least one 
atom within 4 A of TrkH in the complex structure, are marked in yellow. 

b, Diagram illustrating a possible gating mechanism, with the closed channel 
shown on the left, and the open channel on the right. The D3M2b helix is 
represented as a red cylinder. For clarity, gating of only one TrkH protomer is 
shown. ¢, Left panel: single-channel currents through the TrkH-TrkA (Aloop- 
WT) complex before and after addition of ATP. The holding potential is 
+50 mV. Right panel: the Poyen of TrKH-TrkA (WT-WT) and TrkH-TrkA 
(Aloop-WT) before and after addition of 5 mM ATP. The error bars are s.e.m. 
from three independent patches. 


channel to conductance in its partner. However, why this behaviour 
occurs more prominently when Na™ or Li* occupies the selectivity 
filter remains unknown. 

In addition, we present two different structures of the TrkA gating ring, 
solved in complex with TrkH or in isolation. From these structures we 
have proposed a model for the gating of TrkH by TrkA that assigns the 
conformation in the complex to the closed state of TrkA, and the con- 
formation in isolation to the open state. However, although the ‘open’ 
structure was obtained bound to an analogue of the channel opener ATP, 
it should be noted that crystal packing and the absence of the channel 
probably have a significant effect on the gating ring’s energy landscape, 
and these factors rather than the identity of the bound ligand may be 
responsible for the conformation observed in the crystal structure of the 
isolated TrkA tetramer. In fact, a recently reported structure of TrkA from 
Vibrio vulnificus (Protein Data Bank accession 4G65) assumes a con- 
formation highly similar to the twisted conformation reported here des- 
pite containing no bound nucleotide. Nonetheless, although it may not 
correspond exactly to the structure in the open complex, the isolated TrkA 
structure reveals the range of flexibility accessible to the gating ring. 
Confirmation for the proposed model will require further structural 
and functional studies of the intact complex. 


ARTICLE 
METHODS SUMMARY 


Vibrio parahaemolyticus TrkKH (GenBank accession EDM57342.1) and TrkA 
(EDM59790.1) were overexpressed in BL21(DE3) cells and purified by affinity 
chromatography, followed by cleavage of the tag and a second purification step 
ona size-exclusion column. For purification of the TrkH-TrkA complex, TrkA was 
expressed with a C-terminal polyhistidine affinity tag; for the purification of isolated 
TrkA, TrkA was expressed with an N-terminal GST domain as an affinity tag. The 
TrkH-TrkA complex was solubilized in 40 mM n-decyl-B-b-maltoside, which was 
decreased to 3.5 mM during the size-exclusion chromatography step. TrkKH-TrkA 
crystals were grown by the microbatch method and derivatized with TaBr,, before 
freezing. TrkA crystals were grown in the presence of 10 mM ATP-7S by the sitting- 
drop vapour diffusion method. The structures of TrkA and the complex were 
solved using molecular replacement or combined molecular replacement/single- 
wavelength anomalous dispersion, respectively, with the PHENIX software suite”’. 

Patch-clamp recording of the TrkH-TrkA complex was performed on E. coli 
giant spheroblasts as described previously” *'. Single-channel currents were 
recorded on inside-out patches, with both the bath solution and the pipette 
solution containing 200 mM KCI, 0.5 M sucrose, and 10 mM HEPES at pH 7.4. 
The data were analysed with pClampfit9. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Cloning and purification of the TrkH-TrkA complex. The trkH gene was 
amplified from Vibrio parahaemolyticus genomic DNA and was cloned into 
the pET31b plasmid (Novagen) with an ampicillin-resistance gene. The Vibrio 
parahaemolyticus trkA gene was cloned into a modified pET plasmid (Novagen) 
with a C-terminal polyhistidine tag and a TEV protease recognition site with a 
kanamycin-resistance gene. For large-scale purification of the VpTrkH-TrkA 
complex, BL21(DE3) cells were co-transformed with a 1:1.5 ratio of plasmids 
carrying VpTrkH and VpTrkA, respectively. The transformants were grown in 
Luria broth supplemented with 100 mg]~' kanamycin and 100 mg!’ ampicillin 
at 37°C and induced with 0.5mM isopropyl -p-1-thiogalactopyranoside 
(IPTG) after the optical density at 600 nm reached 1.0. The cell membranes were 
solubilized with 40 mM n-decyl-B-p-maltoside (Anatrace) and the TrkH-TrkA 
complex was purified with TALON Metal Affinity Resin (Clontech Inc.). After 
removal of the His tag with TEV protease, the TrkH-TrkA was subjected to size- 
exclusion chromatography with a Superdex 200 10/300 GL column (GE Health 
Sciences) pre-equilibrated in a buffer of 150mM KCl, 20mM HEPES, pH7.5, 
5mM £-mercaptoethanol and 3.5mM n-decyl-B-p-maltoside. For crystalliza- 
tion, the complex was concentrated to 10 mg ml ' as approximated by ultraviolet 
absorbance and NADH was added to a final concentration of 4mM. 

Cloning and purification of isolated TrkA. For expression of VpTrkA alone, the 
VpTrkA gene was cloned into a modified pGEX-5x-1 GST expression vector (GE 
lifescience) with a TEV protease recognition site replacing the original Factor Xa 
site. The plasmid was transformed into BL21(DE3) cells, and the transformants 
were grown in Luria broth supplemented with 100mg]~' ampicillin at 37°C. 
Once the OD¢o0 nm reached 1.0, expression was induced with 0.5 mM isopropyl 
B-p-1-thiogalactopyranoside (IPTG) at 20 °C for 15 h. GST-VpTrkA was purified 
with glutathione Sepharose 4B resin (Amersham Bioscience). After removal of the 
GST with TEV protease, VpTrkA was subjected to size-exclusion chromatography 
with a Superdex 200 10/300 GL column (GE Health Sciences) pre-equilibrated in a 
buffer of 150 mM KCl, 20 mM HEPES, pH 7.5 and 5 mM f-mercaptoethanol. The 
protein was concentrated to 8mg ml ' as approximated by BCA method. 
Crystallization and structure solution. TrkKH-TrkA crystals were grown by 
microbatch crystallization under mineral oil where 1.5 pl of the protein solution 
was mixed with an equal volume of crystallization solution containing 37.5% 
PEG400, 400 mM ammonium sulphate and 100mM HEPES, pH 6.75. Before 
collecting crystals, tantalum bromide clusters (Jena Bioscience) were applied by 
dissolving approximately 25 1g pl ' powder directly into the crystal growth solu- 
tion and incubating for 10-12h. Before flash-freezing in liquid nitrogen, the 
crystals were cryo-protected by soaking for 2-5 in a solution obtained by sit- 
ting-drop vapour diffusion with equal volumes of protein solution and well 
solution containing 35% PEG400, 400mM ammonium sulphate, 100mM 
HEPES, pH 6.75 and 25 pg wl! TagBryo. 

Isolated VpTrkA was co-crystallized with ATP-yS by vapour diffusion in sit- 
ting drops, where 1.5 ul of the protein solution supplemented with 10 mM ATP- 
yS was mixed with an equal volume of crystallization solution. Crystals were 
obtained in two conditions: 10% polyacrylic acid 5100, 100 mM Tris, pH 8.0, 
20mM MgCl, and 5% glycerol; or 30% PEG400, 100 mM glycine, pH 9.0 and 
200 mM MgSO,. Before flash-freezing in liquid nitrogen, the crystals were cryo- 
protected by increasing the concentration of glycerol (25% v/v) and PEG400 (35% 
v/v) in the mother liquor. 
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X-ray diffraction data were collected at the beamlines X29 and X4A and C at 

the National Synchrotron Light Source, ID-24 and ID-17 at the Advanced Photon 
Source, 5.0.2 and 8.2.2 at the Advanced Light Source, and Al and F1 at CHESS. 
The structure of the TrkKH-TrkA complex was solved by MRSAD using the 
Phenix software suite’*, with the TrkH structure (3PJZ) as a search model and 
experimental phases from the 18 TagBr,, clusters in the asymmetric unit. Iterative 
rounds of manual and automatic structure refinement were carried out with 
Coot” and phenix.refine, respectively, and protein geometry was monitored 
using Molprobity*’. The final model contains 18 tantalum bromide clusters, 4 
NADH molecules, 4 TrkA molecules and 4 TrkH molecules. Residues 63-65 and 
159-177, located on cytoplasmic loops, are unresolved in the TrkH molecules; 
residues 162-163 and 178-181 are unresolved in TrkA. The isolated TrkA struc- 
ture was solved using molecular replacement with the TrkA structure from the 
complex, and refined by a similar method to the structure of the complex. The 
final model of the isolated TrkA structure contained four molecules of ATP-yS 
and two TrkA molecules. 
Electrophysiology. The VpTrkH- and VpTrkA-containing vectors used in 
expressing the TrkH-TrkA complex were used. All mutations were made with 
the QuikChange kit (Agilent Technologies) and verified by sequencing through 
the entire coding regions. Giant spheroplasts were prepared from E. coli following 
a protocol described previously””*’, with minor modifications. In brief, a single 
colony co-transformed with TrkH and TrkA was incubated in 5 ml of modified 
LB medium (0.5% NaCl, 1% tryptone and 0.5% yeast extract) by shaking at 
250 r.p.m., 37°C. When the ODgoo reached 0.5, the culture was diluted by 10- 
fold into 5 ml fresh modified LB with the addition of 60 pg ml * cephalexin. The 
elongated filaments were then collected by centrifuging 2 ml of the culture at 
3,000g for 1 min. The supernatant was removed carefully and 0.5 ml 0.8 M sucrose 
was added to re-suspend the pellet. The following solutions were added sequen- 
tially: 30 pl of 1M Tris-HCl (pH 8.0), 24 pl of 0.5mgml' lysosome, 6 pl of 
5mgml ! DNase I, and 6 ul of 0.125 M EDTA-NaOH (pH 8.0). The mixture 
was incubated for 7.5 min at room temperature and 100 ll of the stop solution 
(0.7 M sucrose, 20 mM MgCl, and 10 mM Tris-HCl, pH 8.0) was added to stop 
the digestion. The spheroplasts were aliquoted and stored at —20 °C. The frozen 
spheroplasts were thawed on ice and used in electrophysiology. 

Patch-clamp currents were recorded on inside-out patches pulled from the 
spheroplasts placed on clean glass coverslips. Electrodes were drawn from patch 
glass (TW100F-4, World Precision Instruments) and polished (MP-803, 
Narishige Co.) to a resistance of 7-10 MQ. Both bath solution and the pipette 
solution contained 200 mM KCl, 0.5 M sucrose and 10 mM HEPES at pH 7.4. All 
solutions of perfused nucleotides were prepared from sodium salts. No pressure 
was applied to the membrane after the GQ seal formation was achieved, which 
excluded the endogenous mechanosensitive channel activities’. The analogue 
signals were amplified by an Axon-200B patch-clamp amplifier (Molecular 
Devices Inc.), and filtered at 1 kHz using the built-in Bessel filter, digitized at 
100 pts by Digidata 1322a (Molecular Devices Inc.), and recorded to a computer 
hard disk. The data were analysed with pClampfit9 software. 


32. Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta 
Crystallogr. D 60, 2126-2132 (2004). 

33. Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular 
crystallography. Acta Crystallogr. D 66, 12-21 (2010). 


©2013 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


doi:10.1038/nature12055 


The structure of the KtrAB potassium 


transporter 
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In bacteria, archaea, fungi and plants the Trk, Ktr and HKT ion transporters are key components of osmotic regulation, 
pH homeostasis and resistance to drought and high salinity. These ion transporters are functionally diverse: they can 
function as Na* or K* channels and possibly as cation/K* symporters. They are closely related to potassium channels 
both at the level of the membrane protein and at the level of the cytosolic regulatory domains. Here we describe the 
crystal structure of a Ktr K* transporter, the KtrAB complex from Bacillus subtilis. The structure shows the dimeric 
membrane protein KtrB assembled with a cytosolic octameric KtrA ring bound to ATP, an activating ligand. A 
comparison between the structure of KtrAB-ATP and the structures of the isolated full-length KtrA protein with ATP 
or ADP reveals a ligand-dependent conformational change in the octameric ring, raising new ideas about the mechanism 


of activation in these transporters. 


The Trk/Ktr/HKT superfamily of ion transporters includes the Trk, 
Ktr and HKT transporter families’. These transporters are closely 
related to the superfamily of tetrameric cation channels**, which 
includes potassium, sodium and calcium channels. The membrane 
proteins in both superfamilies share the same architecture**”*: four 
subunits or repeats, each with the TM-P loop-TM (where TM indi- 
cates transmembrane helix) structural motif first seen in the KcsA 
potassium channel structure’’, assemble to form an ion pore with a 
‘selectivity filter’ along its central axis. In addition, the cytosolic 
regulatory proteins of the Trk and Ktr transporters and of the 
MthK""? and large-conductance Ca**-gated (BK) K* channels!?-!5 
are RCK (regulate conductance of K*) domains. 

Ktr ion transporters are crucial KT transport systems in some 
bacteria’®”’, with a role in resistance to osmotic stress and high salinity 
by mediating the early uptake of K* (refs 16, 18). The Ktr ion trans- 
porters are composed of two essential components*'®: a membrane 
protein (KtrB or KtrD) and a cytosolic regulatory protein (KtrA or 
KtrC). Studies with cells show that Ktr proteins*"’ transport K* but 
are also permeable to Na’, and that K* transport is ATP?’ and Na~ 
dependent**'*. Furthermore, it has been shown that truncated KtrA 
forms octameric rings*'” and that KtrB assembles as homodimers”. 
The structures described here clarify the molecular basis of some of 
these properties as well as the structural relationship between these 
transporters and ion channels. 


KtrAB architecture 


We have determined the structure of a KtrAB potassium transporter 
with diffraction data to 3.5 A (Fig. 1 and Supplementary Table 1). Two 
diffraction data sets to better than 4 A were collected out of many thou- 
sands of crystals tested. The structure was determined from an aniso- 
tropic diffraction data set which extends to 3.3 A in the best direction; 
four- and eight-fold averaging together with cross-crystal averaging led 
to the calculation of high-quality electron-density maps (Supplementary 
Fig. 1). The crystallographic asymmetric unit contains a KtrA octameric 
ring associated with two identical KtrB dimers, one on each face of the 
ring (Supplementary Fig. 2). This arrangement resulted from the in vitro 
assembly of independently expressed and purified protein components 
and the existence of two identical KtrB binding surfaces in KtrA. 


In the bacterial membrane the KtrAB complex is composed of a 
homodimeric KtrB membrane protein and a cytoplasmic octameric 
KtrA ring (Fig. 1a). The KtrA octameric ring (Fig. 1a, b and Supplemen- 
tary Fig. 3) is a four-fold symmetric assembly of four KtrA dimers with 
the typical RCK domain arrangement’*”*”*, the amino (N) lobe forming 
the ring and the smaller carboxy (C) lobe at the periphery. The ligand- 
binding site in the N lobe is occupied by an ATP molecule (Fig. 1a and 
Supplementary Figs 1b and 3b). The inner circumference of the ring 
delimits a ~30-A-wide hole which, together with gaps (approximate 
dimensions: 17 A by 9 A and 18 A by 11 A) between the surfaces of the 
KtrB homodimer and the KtrA ring, give access from the cytosol to the 
cytoplasmic pore of KtrB (Fig. 1b and Supplementary Fig. 3a). 

The interactions between KtrB and the KtrA ring are an important 
difference relative to KT channels!*"*. In channels the pore subunit 
and one or two RCK domains are encoded in the same polypeptide 
and the RCK octameric ring hangs from the cytoplasmic face of the 
membrane protein through a polypeptide tether. In the KtrAB com- 
plex, the KtrB homodimer sits directly on a face of the KtrA ring 
(Fig. 1c), close to a ring diagonal, and cytoplasmic loops establish 
two different types of contacts with pairs of RCK domains in the ring: 
the lateral contact and the tip contact (Fig. 1c, d). The lateral-contact 
regions involve a mid-section of the KtrB carboxy terminus, which 
runs along the homodimer interacting with the neighbouring KtrB 
subunit and the KtrA ring (Fig. 1c, dand Supplementary Fig. 4a). The 
tip-contact regions occur at the apexes of the cytoplasmic face of the 
KtrB homodimer (Fig. 1c, d and Supplementary Fig. 4b) where the 
KtrB cytoplasmic loops connecting repeats D1 and D2 interact with 
two opposing KtrA subunits. The functional relevance of these inter- 
actions and of our in vitro assembled KtrAB complex is demonstrated 
in the Supplementary Discussion. 


KtrB structure 


The KtrB subunit structure has four repeats (D1 to D4) wrapped around 
a central axis and cradling the selectivity filter (Figs 1c and 2a). Each 
repeat (Supplementary Figs 5a, b) includes the M1 helix spanning the 
bilayer, followed by the P-loop region (that includes the pore (P) helix 
and the selectivity-filter sequence) and a second shorter (M2a) helix, 
which ends close to the centre of the membrane. A C-terminal (M2b) 
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Figure 1 | Structure of the KtrAB K* transporter. a, Cartoon of the KtrAB 
structure with the extracellular side at the top. KtrB subunits are in orange and 
light blue whereas KtrA subunits are in dark blue and red; ATP is shown as 

yellow spheres and K* as magenta spheres; cell membrane is shown as a pale 
yellow rectangle. b, Surface representation of KtrAB rotated relative to panel 

aalong the vertical axis. One KtrA dimer has been removed for viewing the hole 
at centre of the octameric ring. N and C lobes of KtrA subunits are indicated. 


helix completes each repeat, running at a shallow angle towards the 
membrane cytoplasmic face. The selectivity filter is funnel-shaped with 
its wider mouth (11-13 A across) towards the extracellular face and the 
single ion-binding site forming the narrow end (Fig. 2a and 
Supplementary Fig. 5b). Immediately below the KtrB selectivity filter 
there is an intramembrane loop (Fig. 2a-c), a residue stretch that con- 
nects helices M2a to M2b in repeat D3 and is composed almost exclu- 
sively of glycines, alanines and serines (Supplementary Fig. 5c). The 
intramembrane loop together with a highly conserved arginine (R417) 
form a structure that blocks access between the selectivity filter and the 
cytoplasmic pore; mutations or truncations in this region increase ion 
flux and this region has been proposed to be a transporter gate””’. An 
open pathway runs from the R417/intramembrane loop to the cyto- 
plasmic face of the protein (Fig. 2a). The pathway is lined by a mix of 
polar and apolar residues and is ~4A wide at its narrowest point. 
Overall, this architecture resembles closely the architecture of pot- 
assium channel pores (Supplementary Fig. 6). 

The monomer and homodimer structures of KtrB and of the 
related TrkH’? (a Trk membrane protein) are very similar (Sup- 
plementary Fig. 7). A major difference between KtrB and TrkH is 
found in the C terminus. In KtrB the long C terminus runs from 
one subunit, forms a lateral contact and snakes into the cytoplasmic 
pore of the neighbouring KtrB, finishing just below the intramem- 
brane loop (Figs 1c, d and 2a, b). In contrast, the TrkH C terminus is 
much shorter and does not establish any contacts (Supplementary 
Figs 5c and 7a, b). The last seven residues of the KtrB C terminus 
(Supplementary Fig. 1c) line the wall of the cytoplasmic pore of the 
neighbouring subunit (Fig. 2a, b). The highly conserved C-terminal 
glycine (G445) is positioned with its carboxylate interacting with the 
side chain of a lysine, which is part of the intramembrane loop and is 
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c, Extracellular view of KtrB repeats (D1, red; D2, yellow; D3, orange; D4, blue); 
KtrA is shown as grey surface. Asterisks indicate green KtrB C termini running 
along the KtrB dimer. d, Detail of contact regions. Colouring as in panel 

c except for the D3-D4 loop, shown in purple. The disordered stretch in the 
D1-D2 loop is shown as a dotted line. KtrB helices and N and C termini are 
labelled. 


conserved in both the Ktr (K315 in KtrB) and Trk (K357 in TrkH) 
transporters (Fig. 2c). The C-terminal glycine does not exist in Trk 
proteins, but in the TrkH structure the side-chain carboxylate of a 
conserved glutamate (E470) occupies the same volume as the glycine 
carboxylate and also interacts with the lysine. 

To assess the importance of positioning a carboxylate close to the 
intramembrane loop we generated truncations of the C-terminal resi- 
dues and evaluated their impact in a growth complementation 
assay”°* of the TK2420 Escherichia coli strain. This strain has its major 
K* transport systems disabled and only grows in K* concentration 
230 mM; however, when expressing wild-type KtrAB it can grow in 
K* concentrations of ~0.3 mM. The position of the carboxylate does 
not seem to be essential as sequential truncation of up to three 
C-terminal residues has no effect or causes only a small shift in rescue 
(Fig. 2d). Furthermore, the stability of the KtrAB complex seems to be 
undisturbed (Fig. 2e). However, removal of four residues results in a 
non-functional transporter, equivalent to control plasmid (Fig. 2d). 
Consistently, KtrB without the last four residues, although it still 
forms a homodimer, does not assemble with KtrA (Fig. 2e). All muta- 
tions display similar expression levels (Supplementary Fig. 4f). 
Presumably, the fourth residue, a conserved apolar residue which in 
the structure seems to function as a C terminus anchor, has a role in 
the positioning of the C terminus and therefore on the stability of the 
C-terminus-mediated lateral contact. Notably, we had previously 
shown that truncation of the last 10 or 15 residues of the C terminus 
abolishes strain complementation and destabilizes the KtrB dimer’. 
Although we cannot rule out more subtle functional roles for the C 
terminus, such as cooperativity between KtrB subunits, our results 
demonstrate that the C terminus has a role both in the stability of the 
KtrB homodimer and of the KtrAB complex. 
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Figure 2 | KtrB subunit. a, KtrB subunit without repeat D2. K~ ion is 
indicated as a red sphere; the intramembrane loop, K315 (at the centre) and 
R417 (on the left) are shown in magenta. The ion pathway, calculated with 
Hollow" with a 1.4 A probe radius, is shown as a white surface. b, Close-up view 
of ion pathway, rotated 180° relative to panel a. C-terminal residues from the 
neighbouring KtrB subunit are shown in green stick representation. 

c, Intramembrane loops in KtrB (magenta and green) and TrkH (yellow), after 


Regulation of KtrAB activity 


We first determined that KtrA has a strong binding preference for 
ATP and ADP over other small molecules (Fig. 3a) resembling other 
KtrA proteins”. The functional effect of these two ligands was then 
evaluated in a *°Rb* uptake assay*”* by reconstituting KtrAB with 
ATP or ADP (Supplementary Fig. 8a) in the presence of an excess of 
KtrA to minimize the formation of the KtrB-KtrA-KtrB complex. 
KtrB alone was also reconstituted; it allows K* into cells but less 
efficiently than KtrAB*. 

All protein forms mediate flux in this assay (Fig. 3b). Importantly, 
flux is markedly higher with ATP, supporting previous cell-based 
results which showed that ATP is an activator of KtrAB. 
Noticeably, the 86Rb* flux difference between KtrAB-ATP and 
KtrB (~2-fold difference at 30 min) is smaller than expected from 
Kt uptake measurement in cells* (~10-fold difference in Vinax), 
which may indicate that, in addition to ATP, other activating factors 
are necessary for full transporter activity. 

To understand how the two ligands affect KtrA we determined the 
crystal structures of this protein bound to ATP and ADP at ~3.2 A 
and ~2.9 A, respectively (Fig. 4a, b and Supplementary Table 1). The 
ADP- and ATP-bound octameric rings adopt different conforma- 
tions: the KtrA-ATP ring adopts a four-fold symmetric square con- 
formation and the KtrA-ADP ring adopts a two-fold symmetric 
diamond conformation. A conformational difference is also observed 
in solution, through controlled proteolysis (Supplementary Fig. 8b). 
Both structures present clear electron density for the ligand’s aden- 
osine and phosphate groups (Supplementary Fig. 1d, e). Although the 
conformation of the adenosine group and its interactions with the 
protein are very similar in the two structures, the phosphate groups 


++ ++ Control plasmid 
owt KtrA+KtrB(A4) 
om KtrA+KtrB(A3) 
‘A2) 
Ow KtrA+KtrB(A1) 


ARTICLE 


© 
Ktrg 5 
<=, 


— KtrA+KtrB 
—— KtrA+KtrB(A4) 


— KtrA+KtrB(A3) 


Absorbance units (280 nm) 


8 10 12 14 
Elution volume (ml) 


superposition of pore regions. d, Optical density of E. coli TK2420 (expressing 
different constructs) cultures versus K* concentration. Al, A2, A3 and A4 
indicate KtrB C-terminal sequential truncations. Mean + s.d. from three 
independent experiments is shown. e, Size-exclusion chromatography elution 
profiles of mixtures of wild-type KtrA and wild-type or mutant KtrB, at same 
concentrations and ratios. Elution volumes for KtrAB, KtrA (octamer) and 
KtrB (monomer and dimer) are indicated. 


are rearranged (Fig. 4c); in particular, the y-phosphate in the ATP 
structure occupies the position taken by the B-phosphate in the ADP 
structure. This is accomplished by a shift in G79 and a repositioning of 
the o- and B-phosphates relative to the ADP structure. The phosphate- 
protein interactions are also different: in the ADP structure the 
B-phosphate interacts with residues from its own binding site, in par- 
ticular with the side chains R16 and K103; in contrast, in the ATP 
structure the y-phosphate interacts with R16 from its binding site, 
whereas the B-phosphate interacts with R16 from the neighbouring 
KtrA subunit binding site (Fig. 4c). Presumably owing to the intradi- 
mer interactions mediated by ATP, the two binding sites within a 
KtrA dimer are brought together through a ~16° change in the angle 
between subunits (Fig. 4d); the distances separating the conserved D36 
in the two binding pockets are ~35 A in the ADP structure and ~30 A 
in the ATP structure. This dimer rearrangement is accompanied by 
rigid-body rotations across two of the four dimer-to-dimer interfaces 
in the ring and is reflected in the different ring conformations. Most 
notably the distance separating F71 (a KtrA residue that participates 
in the KtrAB tip contact region) in opposite ring subunits is 68.7 A in 
the KtrA-ATP ring, whereas in KtrA-ADP the equivalent distances 
are 68.3 and 81.8 A (Fig. 4a, b). Similarly, the distance separating L66 (a 
KtrA residue that participates in the KtrAB lateral contact region) in 
KtrA-ATP is 49.8 A, whereas the equivalent distances in KtrA~-ADP 
are 49.8 and 66.1 A. Therefore, the conformational change from the 
KtrA-ADP ring to the ATP ring involves the asymmetric contraction 
of the ring, with one diagonal direction changing by 13-16 A, whereas 
the other diagonal is unaffected. 

Notably, the conformation of the octameric ring and the details of 
ligand binding are basically identical between the isolated KtrA-~ATP 
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Figure 3 | Ligand regulation of KtrAB transporter. a, SDS-PAGE of KtrA 
protein eluted from ADP-agarose with 1 mM of different adenosine-containing 
ligands. Purified KtrA is indicated as Ctrl. b, Time course of normalized ®oRb 
uptake into liposomes with reconstituted KtrAB transporter (with ATP or 
ADP) or KtrB alone (with ATP). Control measurement was with liposomes 
formed in the presence of KtrA alone; normalization after addition of 
valinomycin at end of assay. Mean = s.e.m. calculated from six assays 
corresponding to triplicates from two separate liposome preparations. c.p.m., 
counts per minute. 


structure and the KtrA structure that forms the KtrAB-ATP complex 
(Supplementary Fig. 3a). This suggests that unlike truncated KtrA 
(composed by N lobe alone; Supplementary Fig. 9) where ligand 
binding and ring conformation are uncoupled”’, in the full-length 
KtrA the ATP molecule induces the formation of a unique set of 
intradimer interactions which determine the conformation of the 
octameric ring. The absence of the y-phosphate in ADP is reflected 
in the lack of ligand-mediated intra-dimer interactions and in the 
relaxation of the ring. 


Implications for KtrAB activation 

We have uncovered several differences between KtrAB and the BK’*"* 
and MthK'*** channels which have an impact on their mecha- 
nisms of transport activation. In channels, the RCK rings are covalently 
tethered to the C terminus of a single channel pore domain and the 
rings expand symmetrically upon binding of an activating ligand. In 
KtrAB, the cytoplasmic loops of the KtrB homodimer mediate the 
interaction with the KtrA (RCK) ring and the ring contracts asymmet- 
rically upon binding of ATP. 

To propose an activation mechanism for KtrAB we need to define 
its ‘gates’ (that is, regions that undergo a conformational change and 
close/open the ion pathway)—ion transporters” and channels** can 
have multiple gates. In KtrAB, the intramembrane loop has been 
proposed to be a gate”’’. A comparison between KtrB and TrkH 
structures (Supplementary Fig. 7a—d) raises the possibility of another 
gate region; it shows the M2b helix/D1-D2 loop region adopting 
different conformations. In TrkH, helices M2a and M2b (in repeat 
D1) are continuous and the ordered amino acid stretch of the D1-D2 
loop is positioned so that it partially obstructs the cytoplasmic ion 
pathway of KtrB. This indicates that this region is able to adopt 
different positions and could function as a gate. 
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Figure 4 | Conformational changes in the KtrA octameric ring. a, b, Surface 
representations of isolated KtrA-ATP ring (a) and KtrA—ADP ring (b). KtrA 
subunits are shown in red and blue. F71 Ca is shown as yellow dots; L66 Cu is 
shown as white dots. Dimer-to-dimer interfaces that undergo a rigid-body 
rotation between ATP- and ADP-bound states are indicated as rotating; the 
other dimer-to-dimer interfaces are indicated as fixed. c, Stereo view of 
superposed ligand-binding sites from KtrA—ADP (light blue Co trace and side- 
chain stick) and KtrA-ATP (dark blue Cx trace and white side-chain stick). 
Residues, ATP and ADP molecules and ligand phosphates are labelled. Some 
ligand-protein interactions are shown as dashed lines. d, Superposition of a 
KtrA-ATP (dark blue) and KtrA-ADP (light blue) dimer through one of the 
subunits; only N lobes are shown. Ligands are shown as stick representation. 
The distance separating D36 residues (red spheres) in dimer subunits and 
dimer hinge angle change are indicated. 


We foresee therefore two possible activation mechanisms for 
KtrAB (see also Supplementary Discussion’’**). For model 1, the 
asymmetric KtrA contraction/expansion occurs along the diagonal 
defined by the lateral contacts (Fig. 5a, b). The direct connection 
between the KtrB C terminus, involved in the lateral-contact region, 
and the R417/intramembrane loop gate suggests that this would be 
the KtrA controlled gate. There are two pieces of evidence against this 
model: first, the R417/intramembrane gate seems to be closed in the 
KtrAB structure bound to ATP, an activating ligand. Second, sequen- 
tial truncations of 1-3 residues in the C terminus have no apparent 
functional consequence. For model 2, the asymmetric change in KtrA 
occurs along the diagonal defined by the tip contacts (Fig. 5a, c). We 
see no obvious direct structural link between the tip-contact region, 
formed by the D1-D2 loop in KtrB, and the R417/intramembrane 
gate (Fig. 2a). A clear alternative is that the D1-D2 loop is the KtrA 
controlled gate, whereas the R417/intramembrane gate is activated by 
an as-yet-unknown stimulus. Upon ATP binding, the D1-D2 loop 
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Figure 5 | Models for KtrAB activation. Cartoon with the KtrB homodimer 
in orange and grey; KtrA subunits as dark- and light-blue ovals; intramembrane 
loop in magenta; C terminus as green coil; and K* as green sphere. 

a, Extracellular view with asymmetric conformational changes in the KtrA ring 
occurring along two possible directions: along lateral contact (pair of white 
arrows) or along tip contact (grey arrows). b, c, Side views of KtrAB. For model 
1 (b), KtrA asymmetrical conformational change along lateral contact is 
propagated through the KtrB C terminus to intramembrane loop gate 
(indicated by wavy arrow). For model 2 (c), KtrA asymmetrical conformational 
change along tip contact alters conformation of the KtrB D1-D2 loop gate 
(wavy arrows), opening a gate. For full activation the intramembrane loop gate 
has to open upon an as-yet-unknown stimulus (cartoon representation in dark 


grey). 


would adopt the conformation seen in the KtrAB structure, opening 
the cytoplasmic pore and resulting in an increase in transporter acti- 
vity, as detected by our functional assay. Full transporter activity, as 
seen in cells*, would require not just ATP but also the influence of 
another stimulus to activate the intramembrane gate. 

In summary, this study provides novel insights and raises new ideas 
about the structure and mechanism of regulation of the KtrAB K* 
transporter, offering a better understanding of the function of the Trk/ 
Ktr/HKT transporters in bacteria, archaea, fungi and plants. 


METHODS SUMMARY 


Briefly, B. subtilis KtrB was overexpressed in the BL21(DE3) E. coli strain, protein 
extracted with dodecylmaltoside, affinity purified in cobalt matrix and further 
purified by size-exclusion chromatography. Tag-less B. subtilis KtrA protein was 
overexpressed in BL21(DE3), and protein was purified by affinity chromatography 
in an ADP-agarose matrix. KtrAB complex was assembled by mixing KtrB and 
KtrA. Preparation of proteoliposomes and *°Rb* flux assay was performed as 
previously described with some variations®”’. For crystallization the complex 
was further purified and detergent exchanged to Cymal6 by size-exclusion chro- 
matography. Structure determination and representation made use of XDS”, 
MOSFLM”', PHYRE”, PHASER®, PHENIX*™, RAVE”, COOT”’, Diffraction 
Anisotropy Server’, PYMOL*, STRAP”, ESPrit*®, Hollow” and CCP4” pro- 
grams. Growth complementation assay followed protocols previously described®*. 
Protein quality control was performed by mass spectrometry. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Protein expression and purification. N-terminal His-tagged KtrB was overex- 
pressed in E. coli BL21 (DE3), in LB media at 37 °C for 2.5 h after IPTG induction. 
After cell lysis in Buffer A (50 mM Tris-HCl pH 8.0, 120 mM NaCl, 30 mM KCl) 
supplemented with protease inhibitors, the lysate was supplemented with 40 mM 
DDM (n-dodecyl-f-b-maltoside, solgrade from Anatrace) for membrane protein 
extraction overnight at 4 °C. Spin-cleared lysate was loaded onto a Talon metal 
affinity resin (Clontech) and washed with Buffer B (50mM Tris-HCl pH 8.0, 
120mM NaCl, 30mM KCl, 1mM DDM). Protein was eluted with Buffer B 
supplemented with 150mM imidazole. Eluted fractions were concentrated to 
~2.5mgml * and incubated overnight at 4°C with thrombin for tag cleavage. 
At this point the protein was either directly used for KtrAB complex assembly or 
further purified by size exclusion with a Superdex-200 column. 

Tag-less KtrA was overexpressed in E. coli BL21 (DE3), in LB media at 20 °C for 
14-16 h after IPTG induction. Cells lysis was done in Buffer C (50 mM Tris-HCl 
pH7.5, 50mM KCl, 5mM DTT) and cleared lysate loaded into an anion 
exchange column. Fractions containing KtrA were incubated with ADP-agarose 
resin (Innova Biosciences) overnight at 4 °C. Beads were washed thoroughly with 
Buffer D (50 mM Tris-HCl, 150 mM KCl, 1 mM TCEP) and protein was eluted in 
the same buffer supplemented with 5 mM of adenosine-containing nucleotide. At 
this point the protein was concentrated to ~3 mg ml! and either directly used 
for KtrAB complex assembly or further purified by size exclusion with a 
Superdex-S200 column. 

KtrAB complex was assembled by mixing individually purified KtrB and KtrA 
batches, at a pre-defined ratio. The complex mixture was further concentrated 
(~20 mg ml‘) and purified in a size-exclusion Superdex-200 column with Buffer E 
(20mM Tris-HCl pH8.0, 120mM NaCl, 30mM KCl, 5mM DTT, 1.5mM 
6-cyclohexyl-1-hexyl-B-p-maltoside (cymal-6, anagrade from Anatrace)). 
Crystallization. KtrAB complex purified in Buffer E supplemented with 1mM 
ATP (ATP) was concentrated to ~10mgml ' and dialysed overnight against the 
same buffer before crystallization assays. KtrAB complex crystals were obtained by 
the sitting-drop vapour diffusion method, mixing 1 yl of protein with 1 pl of crys- 
tallization solution. The best diffracting crystals grew in 100 mM N-(2-acetamido)- 
iminodiacetic acid (ADA) pH 6.5, 20% polyethylene glycol 400 (PEG 400) and 
200 mM ammonium sulphate at 20 °C. Crystals were flash-frozen in liquid nitrogen 
after addition of mother liquor with 45% PEG 400 directly to the crystal drop. 

Before crystallization KtrA-ATP and KtrA-ADP were run in a size-exclusion 

chromatography column equilibrated with Buffer F (50mM Tris-HCl pH7.5, 
150 mM KCl, 5mM DTT, 1mM ATP or ADP). KtrA—ATP crystals were grown 
by sitting-drop vapour diffusion with 100 mM HEPES pH7.5, 3% polyethylene 
glycol 6000 (PEG 6000) and 2.5% 2-methyl-2,4-pentanediol (MPD) at 20°C. 
Crystals were flash frozen in liquid nitrogen after being transferred to identical 
mother liquor containing 6% PEG 6000 and 30% glycerol. KtrA—ADP crystals 
were obtained with 100 mM citrate pH 5.6, 14.5% PEG 2000 and 200 mM ammo- 
nium sulphate at 20 °C. Crystals were flash-frozen in liquid nitrogen after addi- 
tion of modified mother liquor, with 15% PEG 2000 and 20% glycerol, directly to 
the crystal drop. 
Data processing and structure determination. Diffraction data were processed 
with XDS*° or MOSFLM”, followed by scaling with CCP4 package”. Data from 
KtrAB crystals is highly anisotropic and was analysed, truncated and sharpened 
in the Diffraction Anisotropy Server®’ (http://services.mbi.ucla.edu/anisoscale/). 
Recommended resolution limits after anisotropic correction were 3.8 A, 4.1 Aand 
3.3 A, respectively, along the a*, b* and c* reciprocal space directions. 

KtrA-ATP structure was determined by molecular replacement with 
PHASER® using as search models the truncated KtrA N-lobe structure 
(Protein Data Bank code 2HMU) and C-lobe homology models generated by 
PHYRE”. Improvement of electron-density maps, in particular on the C-lobe 
region, was achieved by cross-crystal averaging between KtrA-ATP and trun- 
cated KtrA using MAMA®*® and DMmulti (CCP4 package). The structure was 
finalized with cycles of model building with COOT* and refinement with 
PHENIX** and REFMAC (CCP4 package). The two subunits in the asymmetric 
unit include residues 6 to 222. Ramachandran plot for KtrA-ATP structure 
contains: 0% outliers, 1.63% allowed, 98.37% preferred. 

The N and C lobes of KtrA-ATP structure were used for molecular replacement 
with the KtrA-ADP data. Cycles of averaging between the four subunits of KtrA- 
ADP and the two subunits of KtrA-~ATP improved maps. Model building and 
refinement produced a final model of KtrA-~ADP that includes: in chain A, resi- 
dues 8-157, 162-174 (built as alanines), 177-222; in chain B, residues 8-222; in 
chain C, residues 8-147, 152-157, 161-176 (built as alanines), 177-222; in chain D, 
residues 8-187, 188-192 (built as alanines), 196-222. Ramachandran plot for 
KtrA-ADP structure contains: 0% outliers, 1.82% allowed, 98.18% preferred. 

The KtrAB structure was determined by molecular replacement using as search 
models a poly-alanine version of the TrkH dimer structure (Protein Data Bank 
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code 3PJZ) and the KtrA-ATP octameric ring structure. Cycles of four-fold 
averaging for the KtrB subunits and eight-fold averaging for the KtrA ring sub- 
units together with cross-crystal averaging with the KtrA-ATP structure resulted 
in great improvements in the electron-density maps and allowed tracing of the 
KtrB chain. Chain trace was confirmed with a selenomethionine anomalous map 
calculated to 7.4 A, which at a 3.60 contour revealed the position of 9 methionines 
out of 14 possible (Supplementary Fig. 10), and through a comparison with the 
TrkH structure. Structure was finalized through cycles of model building and 
refinement. The model was refined against data to 3.5 A as there was no visible 
improvement in refinement statistics or map quality with data to the recom- 
mended limit (see above). In the KtrAB structure the KtrB subunits span residues 
15 to 445 (except for a missing stretch formed by residues 103 and 104) whereas 
the KtrA subunits span residues 7-222. Ramachandran plot for KtrAB crystals: 
0% outliers, 5.94% allowed, 94.06% preferred. Model analysis was performed with 
PYMOL*, STRAP*, ESPrit*® and Hollow’. 

Complementation assay. The complementation assay with TK2420 follows pro- 
cedures previously described***. Briefly, mutants and wild-type KtrB and KtrA 
were cloned into a dicistronic constitutive expression plasmid. The TK2420 strain 
was transformed with the different plasmids and plated on LBK (Luria-Bertani 
broth where NaCl is replaced by KCl) agar plates. Single colonies were picked and 
grown in liquid LBK overnight at 37 °C. A 2-ml aliquot of cells was gently pelleted 
and re-suspended in 400 pl of minimal media containing 0.1 mM K*. A 15pl 
aliquot of re-suspended cells was then used to inoculate 5 ml of minimal media 
prepared with different K* concentrations (0.1, 0.3, 1, 2, 6, 10, 30 or 115 mM) and 
grown for 15h at 37 °C at which point optical density at 595 nm was measured. 
Preparation of proteoliposomes. Preparation follows previously described 
methods*”’. Polar E. coli lipids in chloroform were dried under a stream of argon, 
washed in pentane and dried again. Residual solvent was removed under vacuum. 
Lipids were re-suspended at 10 mg ml’ in swelling buffer (150 mM KCl, 10 mM 
HEPES, 5 mM N-methylglucamine, pH 7.6) using a bath sonicator. Lipids were 
solubilized by adding decylmaltoside (powder) to 40mM. After 2h of gentle 
agitation, dodecylmaltoside-solubilized protein, in KtrB buffer (150 mM KCI, 
20 mM Tris-HCl, 5 mM DTT, 0.5 mM dodecylmaltoside, 200 uM NTP, pH 
8.0), was added at 1:100 w:w protein-to-lipid ratio and incubated for 30 min at 
room temperature. KtrAB complex was assembled just before reconstitution with 
excess of either ATP-bound KtrA or ADP-bound KtrA. The amount of protein 
added to 100 kl solubilized lipids was adjusted to contain 10 j1g KtrB both when 
reconstituted alone or as KtrAB complex. Detergent was removed by dialysis 
against swelling buffer supplemented with 20 1M ATP or ADP and with Na* 
concentration at 40 1M, using 15kDa cutoff dialysis membranes for 4 days at 
4 °C. Control liposomes were prepared similarly, except that only KtrA was added 
to the solubilized lipids. 

S°Rb* flux assay. Assay as previously described®”,with small modifications. A 
gradient of K~ across the liposome membrane was generated, immediately before 
the assay, by loading 100 1 aliquot of liposomes into a 5 ml disposable spin column 
containing Sephadex G50 (fine) pre-swollen with sorbitol buffer (20 uM KCl, 
150 mM sorbitol, 10 mM HEPES, 5 mM N-methylglucamine, pH 7.6) and centrifu- 
ging for 1 min at 700g. After forming the gradient, liposomes were supplemented 
with 20 .M ADP or ATP (final concentration of Na‘ was set to 40 1M) and mixed 
with three volume of °°Rb* assay buffer (Rb at ~2,000 counts wt in sorbitol 
buffer). At different time points 100 jl of uptake reaction were loaded on a 1.5-ml 
bed volume Dowex cation exchange columns (pre-washed in 150mM sorbitol 
solution plus 5mgml' BSA to avoid unspecific binding to resin, and then pre- 
equilibrated in 150 mM sorbitol solution) and eluted with 6% sorbitol solution. The 
eluate was collected in 4-ml scintillation vials and mixed with 1.5 ml Optiphase 
scintillation fluid. At the end of the experiment 900 nM valinomycin was added to 
the last 100 1] aliquot of the uptake reaction, incubated for 3 min and then loaded on 
Dowex columns as described above. The uptake of *°Rb* into liposomes was 
expressed as the percentage of the counts measured after valinomycin treatment 
(this provides a correction for the internal volume of liposomes in the reaction 
sample). Valinomycin was also applied at earlier time points to check whether the 
long timescale of the experiments could affect the final counts, but the values were 
similar to those measured after 2 h. For all conditions means + s.e.m. of triplicates 
were calculated for two separate liposome preparations. 

KtrA ligand binding. Potential ligands for KtrA were screened by making use of 
the tight binding of isolated KtrA to an ADP-agarose matrix and by evaluating 
which small molecules elute the protein from the beads. Purified KtrA was incu- 
bated with the beads at 4 °C. Beads were washed with buffer and then incubated 
for 30 min with 1 mM of different ligands: ATP (adenosine triphosphate), ADP 
(adenosine diphosphate), AMP (adenoside monophosphate), FAD (flavin mono- 
nucleotide), NAD (nicotinamide adenine dinucleotide), NADH (reduced NAD), 
NADPH (reduced nicotinamide adenine dinucleotide phosphate) and GTP (guano- 
sine triphosphate). Eluted fractions were run in an SDS-PAGE. 
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Massive present-day early-type (elliptical and lenticular) galaxies 
probably gained the bulk of their stellar mass and heavy elements 
through intense, dust-enshrouded starbursts—that is, increased 
rates of star formation—in the most massive dark-matter haloes 
at early epochs. However, it remains unknown how soon after the 
Big Bang massive starburst progenitors exist. The measured red- 
shift (z) distribution of dusty, massive starbursts has long been 
suspected to be biased low in z owing to selection effects’, as con- 
firmed by recent findings of systems with redshifts as high as ~5 
(refs 2-4). Here we report the identification of a massive starburst 
galaxy at z= 6.34 through a submillimetre colour-selection tech- 
nique. We unambiguously determined the redshift from a suite of 
molecular and atomic fine-structure cooling lines. These measure- 
ments reveal a hundred billion solar masses of highly excited, 
chemically evolved interstellar medium in this galaxy, which con- 
stitutes at least 40 per cent of the baryonic mass. A ‘maximum 
starburst’ converts the gas into stars at a rate more than 2,000 times 
that of the Milky Way, a rate among the highest observed at any 
epoch. Despite the overall downturn in cosmic star formation 
towards the highest redshifts’, it seems that environments mature 
enough to form the most massive, intense starbursts existed at least 
as early as 880 million years after the Big Bang. 

We have searched 21 deg” of the Herschel/SPIRE data of the 
HerMES blank field survey® at wavelengths 250-500 tm for ‘ultra- 
red’ sources with flux densities So5om <S350um < Ssooum and Ssooum/ 
S3soum > 1.3, that is, galaxies that are significantly redder (and thus, 
potentially at higher redshift) than massive starbursts discovered thus 
far. This selection yields five candidate ultra-red sources down to a flux 
limit of 30 mJy at 500 tm (>5o and above the confusion noise; see 
Supplementary Information section 1 for additional details), corres- 
ponding to a source density of <0.24 deg *. For comparison, models 


of number counts in the Herschel/SPIRE bands suggest a space 
density of massive starburst galaxies at z>6 with Ssooum > 30 mJy 
of 0.014 deg * (ref. 7). 

To understand the nature of galaxies selected by this technique, we 
have obtained full frequency scans of the 3-mm and 1-mm bands 
towards HEFLS 3 (also known as [HERMES $350 J170647.8+584623; 
Ss500um/S350um = 1.45), the brightest candidate discovered in our study. 
These observations, augmented by selected follow-up over a broader 
wavelength range, unambiguously determine the galaxy redshift to be 
Zz = 6.3369 + 0.0009 based on a suite of 7 CO lines, 7 H,O lines, and 
OH, OH*, H,O*, NH3, [C1] and [Cu] lines detected in emission and 
absorption (Fig. 1). At this redshift, the Universe was just 880 million 
years old (or one-sixteenth of its present age), and 1” on the sky corre- 
sponds to a physical scale of 5.6 kpc. Further observations from optical 
to radio wavelengths reveal strong continuum emission over virtually 
the entire wavelength range between 2.2 um and 20cm, with no 
detected emission shortward of 1 tum (see Supplementary Informa- 
tion section 2 and Supplementary Figs 1-11 for additional details). 

HELS 3 hosts an intense starburst. The 870-pm flux of HFLS 3 is 
>3.5 times higher than those of the brightest high-redshift starbursts 
in a 0.25-deg’ region containing the Hubble Ultra Deep Field (HUDF)*. 
From the continuum spectral energy distribution (Fig. 2), we find that 
the far-infrared (FIR) luminosity Lyyz and inferred star formation rate 
(SER) of 2,900 Mgun Yr of HELS 3 (where M,,, is the solar mass) are 15- 
20 times those of the prototypical local ultra-luminous starburst Arp 220, 
and >2,000 times those of the Milky Way (Table 1 and Supplementary 
Information section 3). The SFR of HFLS 3 alone corresponds to ~4.5 
times the ultraviolet-based SFR of all z = 5.5-6.5 star-forming galaxies in 
the HUDF combined”, but the rarity and dust obscuration of ultra-red 
sources like HFLS 3 implies that they do not dominate the ultraviolet 
photon density needed to reionize the Universe’®. 
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Figure 1 | Redshift identification through molecular and atomic 
spectroscopy of HFLS 3. a, Black trace, wide-band spectroscopy in the 
observed-frame 19-0.95-mm (histogram; rest-frame 2,600-130 jim) 
wavelength range with CARMA (3 mm; ‘blind’ frequency scan of the full band), 
the PdBI (2 mm), the JVLA (19-6 mm) and CSO/Z-spec (1 mm; instantaneous 
coverage). (CARMA, Combined Array for Research in Millimeter-wave 
Astronomy; PdBI, Plateau de Bure Interferometer; JVLA, Jansky Very Large 
Array; and CSO, Caltech Submillimeter Observatory.) This uniquely 
determines the redshift of HFLS 3 to be z = 6.3369 based on the detection of a 
series of HO, CO, OH, OH*, NH3, [C1] and [Cm] emission and absorption 
lines. b-o, Detailed profiles of detected lines (histograms; rest frequencies are 
indicated by corresponding letters in a). 1-mm lines (m-o) are deeper, 
interferometric confirmation observations for NH3, OH (both PdBI) and [C1] 


HELS 3 is a massive, gas-rich galaxy. From the spectral energy distri- 
bution and the intensity of the CO and [C1] emission, we find a dust 
mass of Mj = 1.3 X 10° M,,, and total molecular and atomic gas masses 
of respectively M,,, = 1.0 X 10'! Moun and Myy, = 2.0 X 10'° M,,,n. These 
masses are 15-20 times those of Arp 220, and correspond to a gas-to- 
dust ratio of ~80 and a gas depletion timescale of M,,./SFR ~ 36 Myr. 
These values are comparable to lower-redshift submillimetre-selected 
starbursts’*’*. From the [C 1] luminosity, we find an atomic carbon mass 
of 4.5 X 10” Mgun. At the current SFR of HELS 3, this level of carbon 
enrichment could have been achieved through supernovae on a timescale 
of ~10’ yr (ref. 13). The profiles of the molecular and atomic emission 
lines typically show two velocity components (Fig. 1 and Supplementary 
Figs 5 and 7). The gas is distributed over a region of 1.7 kpc radius with a 
high velocity gradient and dispersion (Fig. 3). This suggests a dispersion- 
dominated galaxy with a dynamical mass of Mayn = 2.7 X 10'! Mgun- The 
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(CARMA) not shown in a. The line profiles are typically asymmetric relative to 
single Gaussian fits, indicating the presence of two principal velocity 
components at redshifts of 6.3335 and 6.3427. The implied CO, [C1] and [C11] 
line luminosities are respectively (5.08 + 0.45) X 10° La:.5 

(3.0 + 1.9) X 108 Lun and (1.55 + 0.32) X 10"? Leun- Strong rest-frame 
submillimetre to FIR continuum emission is detected over virtually the entire 
wavelength range. For comparison, the Herschel/SPIRE spectrum of the nearby 
ultra-luminous infrared galaxy Arp 220° is overplotted in grey (a). Lines 
labelled in italic are tentative detections or upper limits (see Supplementary 
Table 2). Most of the bright spectral features detected in Arp 220°" are also 
detected in HFLS 3 (in spectral regions not blocked by the terrestrial 
atmosphere). See Supplementary Information sections 2-4 for more details. 


gas mass fraction in galaxies is a measure of the relative depletion and 
replenishment of molecular gas, and is expected to be a function of halo 
mass and redshift from simulations’. In HFLS 3, we find a high gas mass 
fraction Of fgas = Mgas/Mayn ~ 40%, comparable to what is found in sub- 
millimetre-selected starbursts and massive star-forming galaxies at z ~ 2 
(refs 15, 16), but ~3 times higher than in nearby ultra-luminous infrared 
galaxies (ULIRGs) like Arp 220, and >30 times higher than in the Milky 
Way. From population synthesis modelling, we find a stellar mass of 
Mx = 3.7 X 10'° Mgun, comparable to that of Arp 220 and about half that 
of the Milky Way. This suggests that at most ~40% of Mayn within the 
radius of the gas reservoir is due to dark matter. With up to ~ 101! Mean of 
dark matter within 3.4 kpc, HFLS 3 is likely to reside in a dark-matter halo 
massive enough to grow a present-day galaxy cluster'’. The efficiency 
of star formation is given by & = tayn X SFR/Mgas, where tayn = (r/ 
(2GM))"”? is the dynamical (or free-fall) time, r is the source radius, 
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Figure 2 | Spectral energy distribution and Herschel/SPIRE colours of 
HELS 3. a, HFLS 3 was identified as a very high redshift candidate, as it appears 
red between the Herschel/SPIRE 250-, 350- and 500-um bands (inset). The 
spectral energy distribution of the source (data points; ps, observed-frame 
wavelength; V,.s, rest-frame frequency; AB mag, magnitudes in the AB system; 
error bars are lo r.m.s. uncertainties in both panels) is fitted with a modified 
black body (MBB; solid line) and spectral templates for the starburst galaxies 
Arp 220, M 82, HR 10 and the Eyelash (broken lines, see key). The implied FIR 
luminosity is 2.86193? 10° Leun. The dust in HELS 3 is not optically thick at 
wavelengths longward of rest-frame 162.7 jim (95.4% confidence; 
Supplementary Fig. 12). This is in contrast to Arp 220, in which the dust 
becomes optically thick (that is, tg = 1) shortward of 234 + 3 ym (ref. 20). 
Other high-redshift massive starburst galaxies (including the Eyelash) typically 
become optically thick around ~200 um. This suggests that none of the 


M is the mass within radius r and G is the gravitational constant. For 
r=1.7kpc and M= Mgass this suggests ¢ = 0.06, which is a few times 
higher than found in nearby starbursts and in giant molecular cloud cores 
in the Galaxy”*. 

The properties of atomic and molecular gas in HFLS 3 are fully 
consistent with a highly enriched, highly excited interstellar medium, 


Table 1 | Observed and derived quantities for HFLS 3, Arp 220 and 
the Milky Way 


HFLS 3 Arp 220* Milky Way* 
Zz 6.3369 0.0181 
Megas (Msun)t (1.04 + 0.09) x 101! 5.2 x 109 2.5 x 10° 
Maust Msun)i 1.314232 x 109 ~1 x 10° ~6 x 10” 
Mx (Msun)8 ~3.7 x 101° ~(3-5) x 10!° ~6.4 x 10!° 
Mayn (Mgun) !I 2.7 x10! 3.45 x 101° 2 x 101! (<20 kpc) 
fas (%) 4 40 15 1:2 
Leip (Leun)# 2.861932 x 1013 1.8 x 1074 1.1 x 107° 
SFR (Msun yt 1)s 2,900 ~180 nes 
Taust (K)** 55.9793) 66 ~19 


For details see Supplementary Information section 3. 

* Literature values for Arp 220 and the Milky Way are adopted from refs 20 and 27-30. The total 
molecular gas mass of the Milky Way is uncertain by at least a factor of 2. Quoted dust masses and stellar 
masses are typically uncertain by factors of 2-3 owing to systematics. The dynamical mass for the Milky 
Way is quoted within the inner 20 kpc to be comparable to the other systems, not probing the outer 
regions dominated by dark matter. The dust temperature in the Milky Way varies by at least +5 K 
around the quoted value, which is used as a representative value. Both Arp 220 and the Milky Way are 
known to contain small fractions of significantly warmer dust. All errors are 1o r.m.s. uncertainties. 

+ Molecular gas mass, derived assuming aco = Mgas/L'co = 1 Msun (Kkms~! pc?) ? (see 
Supplementary Information section 3.3). 

¢ Dust mass, derived from spectral energy distribution fitting (see Supplementary Information section 
3.1). 

§ Stellar mass, derived from population synthesis fitting (see Supplementary Information section 3.4). 
\IDynamical mass (see Supplementary Information section 3.5). 

“|Gas mass fraction, derived assuming fgas = Mgas/Mayn (see Supplementary Information section 3.6). 
#FIR luminosity as determined over the range of 42.5-122.5 xm from spectral energy distribution 
fitting (see Supplementary Information section 3.1). 

oSFR, derived assuming SFR (in Mcun yr +) = 1.0 X 107 2° Lrig (in Leun) (See Supplementary 
Information section 3.2). 

** Dust temperature, derived from spectral energy distribution fitting (see Supplementary Information 
section 3.1). 
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detected molecular/fine-structure emission lines in HFLS 3 require correction 
for extinction. The radio continuum luminosity of HFLS 3 is consistent with the 
radio-FIR correlation for nearby star-forming galaxies. b, Flux density ratios 
(350 ptm/250 um and 500 pm/350 pm) of HFLS 3. The coloured lines are the 
same templates as in a, but redshifted between 1 < z< 8 (number labels 
indicate redshifts). Dashed grey lines indicate the dividing lines for red 
(S25oum < S3soum < Soom) and ultra-red (S25om <S3s5oum and 

1.3 X S350m < Ssooum) Sources. Grey symbols show the positions of five 
spectroscopically confirmed red sources at 4< z< 5.5 (including three new 
sources from our study), which all fall outside the ultra-red cut-off. This shows 
that ultra-red sources will lie at z > 6 for typical shapes of the spectral energy 
distribution (except those with low dust temperatures), whereas red sources 
typically are at z< 5.5. See Supplementary Information sections 1 and 3 for 
more details. 


as typically found in the nuclei of warm, intense starbursts, but dis- 
tributed over a large, ~3.5-kpc-diameter, region. The observed CO 
and [C11] luminosities suggest that dust is the primary coolant of the 
gas if both are thermally coupled. The Ljcuj/Lrir ratio of ~5 X 10° * is 
typical for high radiation environments in extreme starbursts and 
active galactic nucleus (AGN) host galaxies’’. The Lycuy/Lcoc-o) ratio 
of ~3,000 suggests that the bulk of the line emission is associated with 
the photon-dominated regions of a massive starburst. At the Lr of 
HELS 3, this suggests an infrared radiation field strength and gas 
density comparable to nearby ULIRGs without luminous AGN 
(figures 4 and 5 of ref. 19). 

From the spectral energy distribution of HFLS 3, we derive a dust 
temperature of Tiust = 567 7)K, ~10K less than in Arp 220, but ~3 
times that of the Milky Way. CO radiative transfer models assuming 
collisional excitation suggest a gas kinetic temperature of Thin = 
14473°K and a gas density of logyo(m(H2)) = 3.807573 cm”? (Sup- 
plementary Information section 4 and Supplementary Figs 13 and 14). 
These models suggest similar gas densities as in nearby ULIRGs, and 
prefer Tyin >> Tausp Which may imply that the gas and dust are not in 
thermal equilibrium, and that the excitation of the molecular lines may be 
partially supported by the underlying infrared radiation field. This is 
consistent with the finding that we detect H,O and OH lines with upper 
level energies of E/kg > 300-450 K and critical densities of >10°° cm* 
at line intensities exceeding those of the CO lines. The intensities and 
ratios of the detected HO lines cannot be reproduced by radiative trans- 
fer models assuming collisional excitation, but are consistent with being 
radiatively pumped by FIR photons, at levels comparable to those 
observed in Arp 220 (Supplementary Figs 15 and 16)”°”’. The CO and 
H,O excitation is inconsistent with what is observed in quasar host 
galaxies like Mrk 231 and APM 08279+5255 at z= 3.9, which lends 
support to the conclusion that the gas is excited by a mix of collisions 
and infrared photons associated with a massive, intense starburst, rather 
than hard radiation associated with a luminous AGN”. The physical 
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Figure 3 | Gas dynamics, dust obscuration, and distribution of gas and star 
formation in HFLS 3. a, b, High-resolution (FWHM 0.35” X 0.23”) maps of 
the 158-j1m continuum (a) and [C1] line emission (b) obtained at 1.16 mm 
with the PdBI in A-configuration, overlaid on a Keck/NIRC2 2.2-um adaptive 
optics image (rest-frame ultraviolet/optical light). The r.m.s. uncertainty in the 
continuum (a) and line (b) maps is 180 and 400 Wy per beam, and contours are 
shown in steps of 30 and 1a, starting at 5a and 3a, respectively. A z = 2.092 
galaxy (labelled G1B) identified through Keck/LRIS spectroscopy is detected 
~0.65” north of HELS 3, but is not massive enough to cause significant 
gravitational lensing at the position of HFLS 3. Faint infrared emission is 
detected towards a region with lower dust obscuration in the northeastern part 
of HFLS 3 (not detected at <1 um). The Gaussian diameters of the resolved 
[Cu] and continuum emission are 3.4kpc X 2.9 kpc and 2.6 kpe X 2.4 kpc, 


conditions in the ISM of HFLS 3 thus are comparable to those in the 
nuclei of the most extreme nearby starbursts, consistent with the finding 
that it follows the radio-FIR correlation for star-forming galaxies. 

HELS 3 is rapidly assembling its stellar bulge through star formation 
at surface densities close to the theoretically predicted limit for ‘max- 
imum starbursts”’. At a rest-frame wavelength of 158 um, the FIR 
emission is distributed over a relatively compact area with 2.6 kpc 
X 2.4kpce physical diameter along its major and minor axes respec- 
tively (Fig. 3; as determined by elliptical Gaussian fitting). This sug- 
gests an extreme SFR surface density of 2’spr ~ 600 Mgun yr | kpc? 
over a 1.3-kpc-radius region, and is consistent with near-Eddington- 
limited star formation if the starburst disk is supported by radiation 
pressure“. This suggests the presence of a kiloparsec-scale hyper- 
starburst similar to that found in the z= 6.42 quasar J1148+5251 
(ref. 25). Such high 2’spp are also observed in the nuclei of local 
ULIRGs such as Arp 220, albeit on scales two orders of magnitude 
smaller. A starburst at such high Xspp may produce strong winds. 
Indeed, the relative strength and broad, asymmetric profile of the 
OH7II, 2(3/2-1/2) doublet detected in HFLS 3 may indicate a molecu- 
lar outflow, reminiscent of the OH outflow in Arp 2207". 

The identification of HFLS 3 alone is still consistent with the model- 
predicted space density of massive starburst galaxies at z>6 with 
Ssooum > 30 mJy of 0.014 deg * (ref. 7). This corresponds to only 
10 *-10°* times the space density of Lyman-break galaxies at the 
same redshift, but is comparable to the space density of the most 
luminous quasars hosting supermassive black holes (that is, a different 
population of massive galaxies) at such early cosmic times”. The host 
galaxies around these very distant supermassive black holes are com- 
monly FIR-luminous, but less intensely star-forming, with typically a 
few times lower Lpyp than ultra-red sources”’. This highlights the dif- 
ference between selecting massive z > 6 galaxies at the peak of their 
star formation activity through Lpyp, and at the peak of their black-hole 
activity through luminous AGN. The substantial population of ultra- 
red sources discovered with Herschel will be an ideal probe of early 
galaxy evolution and heavy element enrichment within the first billion 
years of cosmic time. These galaxies are unlikely to dominate the star 
formation history of the Universe at z> 6 (ref. 5), but they trace the 
highest peaks in SFR at early epochs. A detailed study of this galaxy 
population will reveal the mass and redshift distribution, number 
density and likely environments of such objects, which if confirmed 
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suggesting gas and SFR surface densities of 2'g,; = 1.4 X 10° Mgun pe? and 
Xorg = 600 Mgunyr kpc ? (~0.6 X 10'? Lean kpe 7). The high Yszp is 
consistent with a maximum starburst at near-Eddington-limited intensity. 
Given the moderate optical depth of tg << ~1 at 158 kum, this estimate is 
somewhat conservative. c, d, Peak velocity (c) and FWHM velocity dispersion 
(d) maps of the [C11] emission are obtained by Gaussian fitting to the line 
emission in each spatial point of the map. Velocity contours are shown in steps 
of 100kms °. High-resolution CO J = 7-6 and 10-9 and HO 321-32 
observations show consistent velocity profiles and velocity structure 
(Supplementary Figs 5-7). The large velocity dispersion suggests that the gas 
dynamics in this system are dispersion-dominated. See Supplementary 
Information sections 3 and 5 for more details. 


in larger numbers may present a stern challenge to current models of 
early cosmic structure formation. 
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Detection of nuclear spin precession is critical for a wide range of 
scientific techniques that have applications in diverse fields includ- 
ing analytical chemistry, materials science, medicine and biology. 
Fundamentally, it is possible because of the extreme isolation of 
nuclear spins from their environment. This isolation also makes 
single nuclear spins desirable for quantum-information proces- 
sing, as shown by pioneering studies on nitrogen-vacancy centres 
in diamond’. The nuclear spin of a *'P donor in silicon is very 
promising as a quantum bit*: bulk measurements indicate that 
it has excellent coherence times®’ and silicon is the dominant 
material in the microelectronics industry. Here we demonstrate 
electrical detection and coherent manipulation of a single *'P nuc- 
lear spin qubit with sufficiently high fidelities for fault-tolerant 
quantum computing*. By integrating single-shot readout of the 
electron spin? with on-chip electron spin resonance’, we demon- 
strate quantum non-demolition” and electrical single-shot read- 
out of the nuclear spin with a readout fidelity higher than 99.8 per 
cent—the highest so far reported for any solid-state qubit. The 
single nuclear spin is then operated as a qubit by applying coherent 
radio-frequency pulses. For an ionized *'P donor, we find a nuclear 
spin coherence time of 60 milliseconds and a one-qubit gate con- 
trol fidelity exceeding 98 per cent. These results demonstrate that 
the dominant technology of modern electronics can be adapted to 
host a complete electrical measurement and control platform for 
nuclear-spin-based quantum-information processing. 

Quantum computers have the potential to revolutionize aspects of 
modern society, ranging from fundamental science to information 
technology’’. The successful demonstration of such a machine requires 
the ability to perform high-fidelity control and measurement of indi- 
vidual qubits—the building blocks of a quantum computer. Errors 
introduced by quantum operations and measurements can be miti- 
gated by using quantum error correction protocols, provided that 
the probabilities of the errors occurring are below certain stringent 
thresholds*. State-of-the-art high-fidelity qubit control and read- 
out has been realized using laser-cooled atoms in electromagnetic 
traps'*’*—a result made possible because of their extreme isolation 
in a near-perfect vacuum. 

Qubits based on solid-state physical systems are attractive 
because of the potential for scalability using modern integrated-circuit 
fabrication technologies. However, they tend to exhibit much lower 
system fidelities owing to interactions with their host environment. 
The ability to combine the control and measurement fidelities of 
trapped atoms with the scalability inherent to solid-state implementa- 
tions is therefore highly desirable. The nuclear spin of a single atom is 
promising in this regard as it is a simple, well-isolated quantum system. 
One of the earliest proposals for quantum computing in the solid state 
advocated using the nuclear spin of individual *‘P dopant atoms in 
silicon to encode and process quantum information’. Silicon is an 


1,15,16 


excellent platform for spin-based quantum-information processing 
because it can be enriched in the nuclear-spin-zero **Si isotope’, 
providing an effective ‘semiconductor vacuum’ and very long spin 
coherence times. Experiments in bulk phosphorus-doped isotopically 
enriched silicon (7°Si:P) have demonstrated the potential of this system 
in which the *'P nuclear spin has been used as a quantum memory’ 
and as a qubit with extraordinarily long coherence lifetimes of over 
180s (ref. 6). However, owing to detection limitations, experiments 
have so far been carried out on only large ensembles of *P nuclei 
typically consisting of several billion nuclear spins®*'*®. To realize 
nuclear-spin-based solid-state quantum computing, one must first 
isolate, measure and control individual nuclear spins. 

It is challenging to measure the state ofa single nuclear spin owing to 
its weak magnetic moment. In the solid state, this has been achieved only 
for the nitrogen-vacancy centre in diamond’, with optical detection, 
and for a rare-earth terbium ion by performing electrical trans- 
port measurements through a single molecule’’. In both these cases, 
a coupled electron spin was used to read out the nuclear spin, which 
requires the ability to measure a single electron spin. The same applies 
to the *'P nucleus: the high-fidelity single-shot readout of the donor- 
bound electron spin’ is at the heart of our nuclear spin readout method. 

We use an on-chip all-electrical detection method for both electron 
and nuclear spins, using a compact nanoscale device” (Fig. 1a) com- 
patible with silicon metal-oxide-semiconductor fabrication standards. 
The device consists of ion-implanted phosphorus donors*', tunnel 
coupled to a silicon metal-oxide-semiconductor single-electron tran- 
sistor (SET). 

The *'P donor in silicon possesses a nuclear spin I of 1/2, and at 
cryogenic temperatures it can bind an electron (with spin S= 1/2) in 
the neutral D° donor charge state. Therefore, a single °'P donor con- 
stitutes a two-qubit system, where the two qubits interact with an 
external magnetic field By in proportion to their gyromagnetic ratios: 
Yn = 17.23 MHzT' forthe nucleus” and y, = giip/h = 27.97 GHzT ! 
for the electron, where g = 1.9985 (ref. 24) is the Lande g-factor, jug is 
the Bohr magneton and h is Planck’s constant. In addition, they inter- 
act with each other through the hyperfine interaction A = 117.53 MHz 
(ref. 24) that arises from the overlap between the wavefunctions of 
the electron and the *!P nucleus. If y.By >> A > 2),,Bo, the eigenstates 
of the two-spin system are approximately (in ascending order of 
energy) || #t), |.4}), |? }) and |} {}), where the thin (thick) arrow indicates 
the orientation of the electron (nuclear) spin (Fig. 1b). The system 
can be transformed to that of a single *'P nuclear spin (Fig. 1c) with 
eigenstates |{}) and |{)) by using the nanostructure device (Fig. 1a) to 
ionize the donor to the D™ charge state. 

The *'P nuclear spin readout experiment begins by performing 
electron spin resonance (ESR) on its bound donor electron’®, using 
microwave pulses delivered by an on-chip broadband planar trans- 
mission line*. The system exhibits two possible ESR frequencies, 
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Figure 1 | Qubit nanostructure, spin transitions and electron spin 
resonance spectra. a, Scanning electron micrograph of the active area of the 
qubit device, showing an implanted donor (donor as red arrow), the single- 
electron transistor (SET) and the short-circuit termination of the microwave 
line. The device is mounted in a dilution refrigerator with an electron 
temperature of ~300 mK, and is subjected to static magnetic fields By between 
1.0 T and 1.8 T. Bo is oriented perpendicular to the short-circuit termination of 
the microwave line (solid orange single-ended arrow), which carries a current 
(solid double-ended arrow) and produces an oscillating magnetic field By 
(represented by the solid and dashed circles) perpendicular to the surface of the 
device. TG, top gate; PL, plunger gate; LB, left barrier; RB, right barrier. 

b, Energy-level diagram of the neutral *'P donor system, with corresponding 
transitions for electron spin resonance (ESR) in blue, and for nuclear magnetic 
resonance (NMR) in red. |: electron spin states; ||{}: nuclear spin states. 

c, Energy-level diagram of the ionized *'P donor, with the single NMR 
transition shown in purple. d, ESR spectra obtained at Bp = 1.77 T by scanning 
the microwave frequency and monitoring the electron-spin-up fraction f;. The 
top trace corresponds to an active v., ESR transition (nuclear spin state |{})) and 
the bottom trace to an active ve. ESR transition (nuclear spin state | {})). 


depending on the state of the nuclear spin: v., ~ 
spin ||)) and ve. ~ y-Bo + A/2 for nuclear spin 
experiment, if we assume that the ESR measurement duration is much 
shorter than the nuclear spin flip time, then we expect only one active 
ESR frequency at any instant. Detecting ESR at the frequency Vey 
therefore indicates that the nuclear spin is in state HW), whereas detec- 
tion at V.2 implies the nuclear spin is | {t). 

Having identified the two resonance frequencies through an ESR 
experiment (see Fig. 1d and also ref. 10), we performed repeated mea- 
surements of the nuclear spin state (Fig. 2a) by toggling the microwave 
frequency Vgsp between v.; and v2, obtaining the electron-spin-up 
fraction f; at each point (see Supplementary Information). If the quan- 
tity Af; =f; (ver) — fi(ve1) is positive, we assign the nuclear state |{}), 
and vice versa. A histogram of Af; (Fig. 2d) shows two well-separated 
Gaussian peaks, corresponding to the two possible nuclear orienta- 
tions, with widths determined by the signal-to-noise ratio (SNR) of the 
measurements (Supplementary Information). The nuclear spin read- 
out error (Fig. 2e) is obtained by fitting the two peaks and integrating 
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a, Repetitive single-shot measurements of the nuclear spin state performed by 
toggling Vgsr between ve; = 49.5305 GHz (dark blue) and v.2 = 49.6445 GHz 
(light blue) and recording the electron-spin-up fraction f;. Each data point 
represents the average f; over 250 single-shot measurements of the electron 
spin (acquired in 260 ms). The electron-tunnelling time is of the order of 100 Ls, 
as reported in ref. 10. The dashed red lines indicate instants when a nuclear spin 
quantum jump has occurred. b, Electron-spin-up fraction difference, 

Af; =f; (Yer) — fi (Ver) for the data in a. Af; > 0 indicates nuclear spin | {}), and 
vice versa. ¢, Af; in an experiment with an increased rate of donor ionization/ 
neutralization, J yon/neut- The greater J ion/neut is achieved by including an 
additional phase in the nuclear spin readout measurement, where resonant 
tunnelling of | |) electrons between the donor and SET occurs (see 
Supplementary Information). d, Histograms of Af; for the data in b, showing 
two well-separated Gaussian peaks, each corresponding to a nuclear spin state, 
as indicated. The counts obtained for —0.015 < Af; < 0.05 are attributed to 
nuclear spin quantum jumps occurring during the measurement. The light and 
dark blue solid lines are Gaussian fits to the data (see discussion in main text). 
e, Readout errors as a function of the detection threshold for Af;. The solid dark 
(light) blue line indicates the SNR-limited error for detecting the ||) (| {})) state, 
whereas the black dashed line indicates the total error. f, Nuclear spin flip rates 
Tyas a function of the donor ionization/neutralization rate I yon/neut- The light 
blue line isa fit to 7} = I + pl ionmeut With p = 1.91(8) X 107 © The dark blue 
line is a constant J’y = 1.54(17) X 10 — ? sl. The red and blue shaded regions 
indicate the values obtained from the ee sets in b and ¢, respectively. The error 
bars represent a 95% confidence level (see Supplementary Information). 
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each Gaussian beyond a discrimination threshold Afiy. At the optimal 
value of Afin = —0.025, the SNR-limited readout error is 2 X 10”. 
We observe that the nuclear spin state remains unchanged for sev- 
eral minutes before exhibiting a ‘quantum jump’ to the opposite state’. 
It is also evident (Fig. 2b) that the nuclear spin is predominantly 
oriented in the |{}) state, which we attribute to an electron-nuclear 
spin flip-flop process, in which the energy difference E;y — E,, (that 
is, between states |{\}) and ||{})) is released to the phonon bath'*”*. 
Because E;y — E\ 4 >> kpT in our experiment, this process acts only in 
the direction |{\})—> || {}) (that is, only spontaneous emission of pho- 
nons occurs), and it cannot be responsible for the observed nuclear 
spin jumps from |{}) to |\)). We have verified that the | {}) > |||) trans- 
ition originates from the readout process, where the donor undergoes 
repeated ionization and neutralization events. These events result in a 


time-varying Hamiltonian, where the exact nature of the nuclear 
eigenstates varies slightly between the neutral and the ionized donor 
case. Accordingly, we observe (Fig. 2c, f) that the lifetime of the nuclear 
spin |{}) is inversely proportional to the rate of ionization and neutrali- 
zation (see also Supplementary Information). 

By exploiting the broadband nature of our on-chip microwave 
transmission line, we perform a nuclear magnetic resonance (NMR) 
experiment (see Supplementary Information for details) on the *'P 
nuclear spin (Fig. 3a). For a neutral donor, we expect two NMR 
frequencies depending on the state of the electron: v,; = A/2 + y,Bo 
when the electron spin is ||) and v,2 = A/2 — y,Bo when the electron 
spin is |}) (Fig. 1b). However, we can also perform an NMR experi- 
ment while the donor is ionized (Fig. 1c), as recently demonstrated ina 
bulk Si:P sample’’. In this case there is only one resonance frequency: 
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Figure 3 | NMR spectra and Rabi oscillations of a single *’P nuclear spin. 
a, Observation of nuclear resonance at By = 1.77 T, while the electron spin is 
initialized | |) (see Supplementary Information). The top inset shows the plunger 
gate voltage waveform (grey line) plus NMR/ESR pulse sequence, and the bottom- 
right inset shows the energy levels involved in the NMR transition. b, Dependence 
of the NMR resonances V9, Vn1 and Vp2 on the magnetic field Bo. Solid lines are the 
values predicted using the *'P nuclear gyromagnetic ratio , = 17.23 MHzT’. 
c, Pulse sequence for the coherent rotation of a *'P nuclear spin with the donor in 
the neutral D” state. Depicted is the plunger gate voltage waveform (grey line) and 
combined NMR/ESR pulses. After a coherent NMR pulse at vp; of duration f,, the 
nuclear spin state is read by probing the v.. ESR transition with 300 single-shot 
adiabatic inversion and electron-spin readout measurements, lasting 
approximately 300 ms. The resulting electron-spin-up fraction f;(V,2) is 
compared to a threshold, extracted from the quantum jump experiment (Fig. 2a), 
and a nuclear spin orientation is ascribed to the measurement. d, Rabi oscillation 
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of the neutral 3'P donor nuclear spin with Pymr = 4mW, Bo = 1.07 T and 

Vn1 = 75.7261 MHz. The pulse sequence of c is repeated 40 times for each 

Rabi pulse length ¢,, with five sweeps of tf, performed to give a total of 

200 measurements at each f,. The number of nuclear spin flips is recorded to give 
the flip probability P,. The solid line is a fit of the form P, = Ksin"(tthravitp)» where 
Kand fai are free fitting parameters. e, Rabi frequency, extracted from fits of data 
similar to that in d, against the square root of the radio-frequency power. 

£, Modified pulse sequence to perform Rabi oscillations on the *!P nuclear spin 
with the donor in the ionized D* state. The electron is removed before applying a 
coherent NMR burst. The electron is then replaced so that a single-shot 
measurement can be performed on the nuclear spin. g, Sample Rabi oscillation of 
the ionized donor nuclear spin using Pywr = 126 mW, By = 1.77 T and 

Vno = 30.5485 MHz, with each data point again obtained from 200 measurements 
of the nuclear spin state. h, Plot showing the linear variation of the ionized nuclear 
spin Rabi frequency with the excitation amplitude. 
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Vno = YnBo. Figure 3b shows the magnetic-field dependence of the 
three NMR frequencies, which agree with the expected values assum- 
ing the bulk 31p gyromagnetic ratio), = 17.23 MHzT ' (ref. 23). Fur- 
thermore, we extract g = 1.9987(6) (see Supplementary Information), 
which is within about 0.01% of the bulk value for Si:P, whereas 
the hyperfine splitting A = 114.30(1) MHz is Stark shifted® from the 
bulk value of 117.53 MHz (ref. 24). The observation of a Stark shift of 
A is important, as it has been proposed° as a mechanism to address 
individual *'P nuclear spin qubits while applying a global micro- 
wave field. 

Having found the NMR frequencies, we use short radio-frequency 
pulses to produce coherent superpositions of the nuclear spin states. 
For the neutral (D°) donor, we first initialize the electron in the 
||) state. A pulse of length ¢, and at the v,; resonance is applied imme- 
diately after, followed by a single-shot readout of the nuclear spin 
state (see Fig. 3c). Measuring the nuclear spin flip probability P, as a 
function of ty results in the coherent Rabi oscillations of Fig. 3d, 
whose frequency frabi scales linearly with the square root of the 
radio-frequency excitation power, Pyyp’” (Fig. 3e). The visibility of 
the oscillations in Fig. 3d is ~60%. Deviations from ideality are most 
probably due to erroneous electron initialization in the excited |f) 
state’®, caused by heating resulting from the train of microwave pulses 
used during readout. We modified the pulse sequence to remove the 
electron before applying the radio-frequency excitation at the V9 
transition (Fig. 3f). The Rabi oscillations on the ionized (D*) *!P 
nuclear spin (Fig. 3g, h) have near-unity visibility, as the electron spin 
state has no bearing on the nuclear resonance frequency while the 
donor is ionized. 

To assess the viability of using the *'P nuclear spin as a quantum bit, 
it is critical to characterize the duration over which coherence is pre- 
served. The dephasing time T,* is obtained from a Ramsey-fringe 
measurement (see Supplementary Information), the NMR pulse 
sequence for which is shown in Fig. 4a (see also Fig. 4b for a Bloch 
sphere state evolution). The decay of the oscillations in Fig. 4c, with 
increasing inter-pulse delay t, is the result of fluctuations in the local 
magnetic environment. Fitting the data with a damped cosine function 
P,(t) = P,,(0)cos(2nAdt)exp(-t/T2*), where P,(0) is the amplitude 
and Ad is the average detuning from resonance, reveals T)*(D°) = 
0.84(10) ms for the neutral donor and T>*(D~) = 3.3(3) ms for the 
ionized donor. These dephasing times are ~10* times longer than 
those measured for the electron spin"®. 

As many of the magnetic fluctuations that contribute to T,* occur 
on much longer timescales than the typical nuclear spin manipulation 
time (~25 ps for a 7 pulse), a significant portion of the dephasing can 
be reversed by using spin-echo techniques (Fig. 4d, e). Observing the 
echo signal while varying the delay t yields the decay curves displayed 
in Fig. 4f. We fit the data with functions of the form y = y(0)exp[(-2t/ 
7) |; where y(0) is the amplitude, b is a free exponent and T, is the 
coherence time. For the ionized donor spin, we extract T,(D*) = 
60.0(9) ms and b(D*) = 1.77(7), both of which are fully accounted 
for by the spectral diffusion caused by dynamics of the *’Si nuclear 
spin bath, as quantified by recent theory**. Accordingly, we expect that 
removal of *’Si through isotopic purification” should yield T, values of 
the order of minutes, as observed in bulk-doped samples’. 

Conversely, for the neutral donor spin, we find a shorter T(D°) = 
3.5(1) ms and b(D®) = 2.2(2). This suggests that additional decoherence 
processes occur in the presence of the donor-bound electron. Charge 
noise at the Si/SiO, interface”’ or electronic gate noise’ could cause a 
time-dependent Stark shift of the hyperfine coupling, resulting in fluc- 
tuations of the instantaneous nuclear Larmor frequency. Ionizing the 
donor thus removes the connection between the electrical noise and 
the nuclear spin precession frequency. This is a static effect, in contrast 
to a recent experiment on the nitrogen-vacancy centre in diamond’, 
where rapid ionization and neutralization improved the coherence of a 
*C nuclear spin through motional averaging. 
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Figure 4 | Ramsey fringes and spin-echo decay. a, NMR pulse sequence for 
the Ramsey-fringe experiment. This sequence replaces the single pulse of 
duration t, in Fig. 3c (Fig. 3f) for the neutral (ionized) donor, whereas the 
nuclear spin is read out in the same way. The phase of both 1/2 pulses is such 
that rotation is performed about the X axis on the Bloch sphere, as noted in the 
rotation angle subscript above each pulse. b, Bloch-sphere representation of the 
evolution in the rotating frame for the Ramsey-fringe measurement. The green 
arrow represents the nuclear spin. The purple path indicates dephasing in 
between pulses, whereas the orange path represents a rotation about the X axis. 
c, Ramsey interference fringes for the nuclear spin with the donor in the D° 
(top) and D* (bottom) charge states, taken at By = 1.77 T. Here a 17/2 pulse was 
12.5 us for the D® experiment and 23.5 1s for the D* experiment. We sweep the 
inter-pulse delay, and repeat the sequence 20 times at each t. Ten sweeps are 
performed (with the total 200 measurements occurring over ~3 min for each 7) 
and the nuclear spin flip probability P,, is found. Fits to the data are discussed in 
the main text. d, NMR pulse sequence for the spin-echo experiment. Here we 
also implement phase cycling, where the final 1/2 rotation is first performed 
about the X axis and the measurement is then repeated with the final 

n/2 rotation about the —X axis. Subtracting the two signals ensures a baseline of 
zero. e, Bloch-sphere representation of the spin-echo measurement. Here the 
final 1/2 pulse is about the X axis (the —X axis is not shown). f, Decay of the echo 
amplitude as the delay t is increased for the case of a neutral (circles) and 
ionized (squares) donor. We perform 40 repetitions of the sequence for each t 
and 25 sweeps, totalling 1,000 measurements at each point. The phase-cycled 
echo amplitude is given by P,(Vp_; —X) - Pa(Vp, X), where P,(Vp_, —X/X) 
represents the nuclear spin flip probability measured at the NMR resonance v, 
with a final 7/2 pulse about the —X or X axis. All other experimental conditions 
are as in the Ramsey-fringe experiment. Fits through the data are discussed in 
the text. 
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We now analyse the fidelity of our solid-state qubit. Single-shot 
nuclear spin readout can be performed with extremely high fidelity, 
owing to the quantum non-demolition (QND)"' nature of the mea- 
surement. The physical phenomena responsible for the observed 
quantum jumps of the nuclear spin originate from the measurement 
through the electron spin, and can be viewed as a deviation from QND 
ideality (see Supplementary Information). For the nuclear |{}) state, 
this results in a lifetime of T; = 1,500(360) s (obtained from extended 
data of the measurement in Fig. 2a). For the nuclear ||}) state, the cross- 
relaxation process, which is caused by phonons modulating the hyper- 
fine coupling, yields Ty = 65(15)s (Fig. 2a). These lifetimes must be 
contrasted with the nuclear-spin measurement time Tineas, Which has 
been optimized here to maximize the nuclear-spin readout fidelity (see 
Supplementary Information). Combining the optimal measurement 
time (Tineas = 104 ms) with the observed nuclear-spin lifetimes yields 
the QND fidelities: Fonp(|{})) = exp(— Tmneas/ Ty) = 0.99993(2); and 
Fenn(|{})) = exp(— Tmneas/Ty) = 0.9984(4). We have therefore obtained 
readout fidelities between 99.8% and 99.99%, the highest reported 
so far for any solid-state qubit, and comparable with the fidelities 
observed for qubits in vacuum-based ion-trap systems’. 

The nuclear spin control fidelity F¢ will ultimately be limited by T> 
and the minimum achievable gate-operation time. We quantify Fc by 
directly measuring the rotation-angle error for the ionized donor 
nuclear spin using multiple-pulse dynamical-decoupling sequences 
(see Supplementary Information). Such a measurement encompasses 
extrinsic sources of pulse error, for example due to slow power fluctua- 
tions of the radio-frequency source. The extracted maximum uncer- 
tainty of 3° for an intended x pulse indicates a lower bound on Fe of 98%. 

The results presented here demonstrate that the nuclear spin of a 
single *P donor could constitute a quantum memory in an electron- 
spin-based quantum computer”®, or a high-fidelity qubit in a quantum 
processor where the nuclear spin is the primary computational element”. 
Future experiments will therefore focus on demonstrating electron- 
nuclear entanglement” and the coupling of multiple nuclei mediated 
by hyperfine and exchange interactions’. We anticipate that exploit- 
ing the *'P nuclear spin qubit will open new avenues for large-scale 
quantum-computer architectures, where the quantum coherence of 
well-isolated atomic systems is combined with the manufacturability 
of silicon nanoelectronic devices. 


METHODS SUMMARY 

Device fabrication. For information relating to the device fabrication, see ref. 10, 
where it is described in some detail. 

Experimental set-up. For our voltage pulses, we employed a compensation 
technique—using an arbitrary waveform generator (Tektronix, AWG520) and 
an inverting voltage preamplifier—to ensure that the pulsing shifted only the 
donor electrochemical potentials while keeping the SET island potential constant. 
The voltage V,, (see Supplementary Fig. 1b) was applied directly to the top gate, 
whereas it was inverted and amplified by a factor K before reaching the plunger 
gate. The gain K was carefully tuned to ensure that the SET operating point moved 
along the top of the SET current peaks, as indicated by the blue arrow in Fig. 1d of 
ref. 9. The SET current was measured by a Femto DLPCA-200 transimpedance 
amplifier at room temperature, followed by a voltage post-amplifier, a sixth-order 
low-pass Bessel filter and a fast digitizing oscilloscope. 

The ESR excitations were produced by an Agilent E8257D microwave analogue 
signal generator and the NMR excitations by an Agilent MXG N5182A radio- 
frequency vector signal generator. The two signals were combined at room tem- 
perature using a power divider/combiner, before being guided to the sample by a 
semi-rigid coaxial cable (2.2m in length). Gating of the ESR/NMR pulses was 
provided by the Tektronix AWG520, which was synchronized with the top gate 
and plunger gate pulses. 
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Unexpected strain-stiffening in crystalline solids 


Chao Jiang't & Srivilliputhur G. Srinivasan? 


Strain-stiffening—an increase in material stiffness at large strains—is 
a vital mechanism by which many soft biological materials thwart 
excessive deformation to protect tissue integrity’ °. Understanding 
the fundamental science of strain-stiffening and incorporating this 
concept into the design of metals and ceramics for advanced appli- 
cations is an attractive prospect. Using cementite (Fe;C) and alu- 
minium borocarbide (Al,;BC3) as prototypes, here we show via 
quantum-mechanical calculations that strain-stiffening also occurs, 
surprisingly, in simple inorganic crystalline solids and confers excep- 
tionally high strengths to these two solids, which have anomalously 
low resistance to deformation near equilibrium. For Fe3C and Al,BC;, 
their ideal shear strength to shear modulus ratios attain remarkably 
high values of 1.14 and 1.34 along the (010)[001] and (0001)[0110) 
slip systems, respectively. These values are more than seven times 
larger than the original Frenkel value of 1/21 (refs 4, 5) and are the 
highest yet reported for crystalline solids. The extraordinary stiffen- 
ing of Fe3C arises from the strain-induced reversible ‘cross-linking’ 
between weakly coupled edge- and corner-sharing Fe,C slabs. This 
new bond formation creates a strong, three-dimensional covalent 
bond network that resists large shear deformation. Unlike Fe3C, no 
new bond forms in Al,BC; but stiffening still occurs because strong 
repulsion between Al and B in a compressed AI-B bond unsettles the 
existing covalent bond network. These discoveries challenge the con- 
ventional wisdom that large shear modulus is a reliable predictor of 
hardness and strength of materials*’, and provide new lessons for 
materials selection and design. 

Vigorous research into the atomistic origin of strength of materials 
continues unabated. It is well known that most materials are weakened 
and ultimately break the more they are strained. At low strains, stress ina 
material is linearly proportional to its strain and the constant of propor- 
tionality is the modulus of elasticity or a material’s resistance to elastic 
deformation. Many structurally complex biological materials’ * are not- 
able exceptions to this rule. Although such materials are initially soft they 
rapidly stiffen at larger strains. This nonlinear elastic response is vital for 
the physiological function of tissues like skin and blood vessels, although 
the underlying mechanism is still not fully understood. 

Here we use quantum-mechanical calculations based on density 
functional theory (DFT) to show and elucidate a surprising strain- 
induced stiffening phenomenon in two structurally simple inorganic 
crystalline solids: orthorhombic Fe,C and hexagonal Al,BC3. Owing to 
strain-stiffening, both materials possess high ideal shear strengths des- 
pite their extremely soft elastic shear moduli c44. The classical Frenkel 
model*? estimates the maximum attainable shear strength tax of a 
perfect crystal, the so-called ideal shear strength, to be tmax ~ G/2m— 
about a tenth of its shear modulus G calculated at small strains. 
However, surprisingly, the t,,,,/G ratios for both Fe;C and Al,BC, 
are greater than 1.1. 

Steels set the “gold standard’ for strong materials and continue to be 
the most important and well-studied structural material in the three 
millennia since the start of the Iron Age. We now know that carbon 
steels are primarily strengthened by the precipitation of excess carbon as 
hard Fe3C carbide (Fig. 1a) instead of soft graphite. The fraction and 
morphology of cementite, for example in pearlite and bainite, sensitively 


control the strength and ductility of carbon steels. Cementite has also 
been proposed to be a major constituent of Earth’s core’. 

In contrast to its immense technological and scientific importance, 
the properties of cementite are still poorly understood owing to dif- 
ficulties in growing crystals large enough and pure enough for experi- 
mentation. An interesting finding of our previous DFT study” is the 
extreme elastic anisotropy of Fe3C, due to its anomalously low shear 
modulus ¢44, which is about tenfold smaller than c;; and c¢¢, and over 
twofold less than the c44 of soft aluminium". Experimental validation 
of our DFT predictions comes from the recent X-ray diffraction stress 
measurements". This extremely low c44 may be indicative of mecha- 
nical instability and weak resistance to shear in the (010)[001] slip 
system, which contradicts the observed predominance and stability 
of cementite in steels!, as well as its high hardness (Vickers hardness, 
~10GPa; ref. 13). The recent nanoindentation experiments“ that 
show a significantly smaller anisotropy in the indentation modulus 
of cementite compared to our DFT predictions’ add further intrigue to 
this puzzle. We note that extremely low c4, and large elastic anisotropy 
may also be found in simple cubic polonium and in ionic compounds 
with NaCl structure’*. A detailed discussion of stability of crystalline 
solids may be found in a recent review'’. 

To resolve these experimental-theoretical controversies, here we 
use DFT calculations to predict the stress-strain relations of ideal, 
defect-free Fe;C crystal to the limit of its structural stability'”*!. 
These calculations quantify the ideal strength of cementite, which sets 
an upper bound on the strength of the material and can be compared 
with results from nanoindentation experiments. Surprisingly, they 
reveal an intriguing reversible, nonlinear elastic response to applied 
shear strain that we attribute to dramatic changes in bonding charac- 
teristics at large strains. These new insights are important inputs to 
physics-informed deformation models. Details of our computational 
methods and discussions of the possible effects of phonon and mag- 
netic instabilities are given in the Supplementary Information. 

Figure 2a shows the stress-strain relations for Fe3C under uniaxial 
tension along various crystallographic directions. For [100] and [001] 
tension, the stress increases monotonically with strain and then drops 
rapidly above critical strains of 16% and 23%, respectively. In contrast, 
for the [010] tension, there exists a noticeable stress plateau above 23% 
strain. The ideal tensile strengths, defined as the first maximum in a 
stress-strain curve, are 22.4GPa, 19.5GPa and 31.3 GPa along the 
[100], [010] and [001] directions respectively. The initial slopes of 
the stress-strain curves give Young’s moduli Ejj00}, Ejoio), and Ejoo1] 
of 283.4GPa, 245.4GPa and 221.4GPa, respectively. These values 
agree well with corresponding moduli of 284.0 GPa, 245.8 GPa and 
220.0 GPa calculated from single-crystal elastic constants’. 

To understand the origin of the stress plateau under [010] tension, 
but absent under [100] or [001] tension, we examine its orthorhombic 
crystal structure (space group Puma), as shown in Fig. la. The 12 Fe 
atoms occupy special (Wyckoff position 4c) and general (Wyckoff 
position 8d) sites. The carbon atoms occupy only special 4c sites. 
The fundamental building blocks of cementite are the Fe¢C trigonal 
prisms, which form a two-dimensional slab parallel to the (010) plane 
via edge- and corner-sharing (Fig. 1a). Interestingly, while there exists 
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Figure 1 | Crystal structure of cementite. a, At equilibrium. Green and red 
spheres represent Fe and C atoms, respectively. The six Fe atoms that are the 
first nearest neighbours of a C atom are labelled Fel to Fe6. b, Inhomogeneous 
internal atomic relaxations in response to a 2% (010)[001] shear strain. The 
arrows give the directions and relative magnitudes of atomic displacements. 


a mixture of covalent Fe-C (due to hybridization between C 2p and Fe 
3d states) and metallic Fe-Fe bonding within each slab, Fe—Fe metallic 
bonds couple adjacent slabs”. Uniaxial tension along the [010] direc- 
tion predominantly elongates the interslab Fe-Fe bonds without 
appreciably stretching the stronger Fe-C bonds (Fig. 2b). This pro- 
duces a ‘metal-like’ deformation of cementite under [010] tensile load- 
ing. In contrast, the abrupt snapping of strong covalent Fe-C bonds 
within FesC trigonal prisms destabilizes cementite under [100] and 
[001] tension and creates a steep stress drop at the critical strain. 
Figure 2c shows the stress-strain curves for Fe,C under shear 
deformation on various slip systems. The initial slopes of these curves 
give C44 C55 and Cg shear moduli of 12.2 GPa, 133.8 GPa and 136.9 GPa, 
respectively, in excellent agreement with our previous study’ 
(C44 = 15 GPa, C55 = 134 GPa and cog = 134 GPa). A particularly sur- 
prising finding is the strain-induced stiffening of cementite. Because 


\Fe3 Fe4 


a38 


Fel Fe2 


Rececccccccescosnssnette Nesensneeeennee’ = 


Fe5 Fee. 


The blue dashed lines indicate Fe7—C1 bonds that will form at large shear 
strains, but are absent in the equilibrium structure. The black dashed lines 
represent the primitive unit cell of cementite; pink shading is used to highlight 
the FegC trigonal prisms as the fundamental building blocks of cementite. 


the c44 elastic shear modulus associated with the (010)[001] slip system 
of cementite is exceptionally low, one would expect its shear strength to 
be correspondingly weak. However, although the slope of the stress— 
strain curve under (010)[001] shear is rather low for small strains, it 
rapidly stiffens with increasing shear strain. While strain-stiffening has 
been observed in many biological materials’ and elastomers”, this is 
the first time such a phenomenon has been predicted in a simple 
crystalline solid, to the best of our knowledge. 

Remarkably, despite the fact that c44 is only 1/10 of cg¢, the ideal shear 
strength of 13.9 GPa for the (010)[001] slip system exceeds the 11.9 GPa 
obtained for the (010)[100] system, and is only slightly smaller than the 
17.1 GPa for the (100)[001] system. Thus, owing to strain stiffening, 
cementite exhibits an unusual combination of extremely high elastic 
shear anisotropy and almost isotropic shear strength. This is in striking 
contrast with, for example, tungsten, which is elastically isotropic at low 
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Figure 2 | Fe3C under tensile and shear deformations. a, Stress—strain curves 
under [100], [010] and [001] uniaxial tension. b, Variations of Fe-Fe and Fe-C 
bond lengths with strain under uniaxial [010] loading. By symmetry, the Cl- 
Fe5 and C1-Fe6 bond lengths are equal to those of C1-Fe3 and of C1-Fe4, 


340 | NATURE | VOL 496 | 18 APRIL 2013 


20 
wz 15 
ao 
© 10 
n 
3 5 —e~— Shear (010)[001] 
Bo —4— Shear (100)[001] 
—=~ Shear (010)[100] 
5 
d . T . T . T . T T 
= 24 *— C1-Fe1 —+— C1-Fe2—=— C1-Fe3 
Log +— C1-Fe4 —e— C1-Fe5 >— C1-Fe6 
P= 
D 2.2 
224 = 
i eee 
§ 2.0 Soa =S ae 
O49 


0.4 


0.2 0.3 
Engineering shear strain 


respectively. Bond lengths are normalized by their equilibrium values. c, Stress— 
strain relations along various shear paths. For the (001)[001] shear, an arrow 
marks the inflection point of the stress-strain curve. d, Changes of Fe-C bond 
lengths as a function of applied (010)[001] shear strain. 
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strains and exhibits a high anisotropy in tensile strengths”. This insight 
resolves the puzzling inconsistencies between some recent experiments: 
although X-ray diffraction stress measurements" at small strains indi- 
cated extreme elastic anisotropy of Fe;C in agreement with our earlier 
DFT predictions’, indentation moduli obtained from nanoindendation 
measurements at large strains are much more isotropic'*. For the 
(010)[001] slip system in cementite, the ratio of the ideal shear strength 
to the associated elastic shear modulus (1.14) is anomalously large— 
about sevenfold larger than the Frenkel value. Note that 1,,.,/G ratios 
are normally between 0.04 and 0.24 (ref. 5). 

In nanoindentation experiments, owing to the small (and hence 
potentially dislocation-free) volumes probed, it is generally believed 
that the shear stress at the onset of plasticity can approach the theor- 
etical shear strength of an ideal, defect-free material”*. Furthermore, 
in situ nanoindentation experiments” in a transmission electron micro- 
scope provided direct evidence that a submicrometre grain of alu- 
minium plastically deformed by nanoindentation to a dislocation 
density of ~10'* m * can also support shear stresses close to the theo- 
retical shear strength. This result is contrary to earlier assumptions that 
a dislocation-free volume is necessary to achieve shear stresses near the 
theoretical shear strength of the material. For Fe,C, the measured 


Fe4 


Fe5 Fe6 


Fe7 
Fe3 
Fe4 
C1 
Fe2 
Fe5 Fe6 


Fe7 


Fe5 Fe6 


Figure 3 | Valence charge density in cementite. Isosurfaces at p = 0.5e AS 
along the (010)[001] shear path are shown for strains y = 0 (a), y = 0.21 (at the 
inflection point of the stress-strain curve; b), and y = 0.38 (just after lattice 
instability; c). Ball-and-stick images in the right column illustrate the variation 
of the local coordination of C atoms with shear strain. The arrows point to 
regions of charge depletion between Fe and C atoms, indicating broken bonds 
(blue dashed lines in the ball and stick images). The grey dashed lines represent 
the primitive unit cell. 
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critical shear stress for incipient plasticity is nearly independent of 
crystal orientation and its mean value (11.9 GPa; ref. 14) is indeed close 
to our calculated ideal shear strengths (11.9-17.1 GPa). Remarkably, in 
nanoindentation experiments", the critical resolved shear stress to 
shear modulus ratio for the (010)[001] slip system was also found to 
be abnormally large (~1.0), which is again in good agreement with our 
present findings. 

The cementite crystal contains many internal atomic degrees of free- 
dom not fixed by space group symmetry. Upon straining, these atomic 
degrees of freedom readjust to minimize the strain energy. Furthermore, 
new atomic degrees of freedom can also emerge due to strain-induced 
reduction in crystal symmetry’. For example, (010) [001] shear deforma- 
tion reduces the symmetry of cementite from orthorhombic to mono- 
clinic (space group P2,/c). This allows the y-coordinates of Fe and C 
atoms at special positions, which are initially fixed by symmetry, to relax. 
As shown in Fig. 1b, C atoms respond to the (010)[001] shear by moving 
mainly along the [010] direction towards a neighbouring slab. As a 
consequence, the distance between C1 and Fe7 atoms in two adjacent 
slabs decreases progressively with increasing shear strain and becomes 
comparable to the average Fe-C bond length in equilibrium Fe;C 
(1.99 A) at about 21% strain (Fig. 2d). Interestingly, this strain also 
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Figure 4 | Shear deformation of Al;BC; along two different slip systems. 
a, The crystal structure of Al;BC;. The a, b and c axes are parallel to the [2110], 
[0110] and [0001] directions of the primitive hexagonal unit cell, respectively. 
Green, blue and red spheres represent Al, B and C atoms, respectively. 
Isosurfaces of valence charge density at p = 0.5e A * along the (0001)[0110] 
shear path are shown for strains y = 0 (b) andy = 0.24 (at the inflection point of 
the stress-strain curve; c). d, Stress-strain relation under (0001)[0110] and 
(0110)[2110] shear. For (0001)[0110] shear, the arrow marks the inflection 
point of the stress-strain curve. e, Variations of normalized bond lengths with 
(0001)[0110] shear strain. By symmetry, the B1-C3, B1-Al6 and C1-Al4 bond 
lengths are equal to those of B1-C2, B1-Al1 and C1-A]3, respectively. 
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coincides with the inflection point of the stress-strain curve separating 
the strain-stiffening and strain-softening regimes. 

At this 21% strain, each C atom now forms seven strong covalent 
bonds with neighbouring Fe atoms, as can be seen from Fig. 3b. A 
three-dimensional covalent bonding network is created in cementite 
because the new Fe7-Cl covalent bonds ‘cross-link’ adjacent slabs. 
This three-dimensional network of Fe-—C bonds, which is absent in 
the equilibrium cementite structure (Fig. 3a), effectively resists shear 
and causes the unexpected strain stiffening in cementite. When 
sheared above 21% strain, the slope of the stress-strain curve begins 
to decrease and the cementite lattice softens and becomes unstable at a 
critical strain of 36%, when both C1-Fel and Cl-—Fe4 bonds break. 
This is evident from the charge depletion in the bond region between 
corresponding atoms (Fig. 3c). Our Mulliken overlap population ana- 
lysis, which gives a direct measurement of bond covalency, provides a 
more quantitative evidence of new bond formation and bond breaking 
in Fe3C under (010)[001] shear (Supplementary Fig. 3). 

Our results naturally lead to the question of whether strain-stiffening 
is present in other inorganic materials. To this end, we have studied 
the shear deformation of hexagonal Al,BC; (space group P63/mmc), 
another layered material (Fig. 4a) with high hardness (Vickers hard- 
ness, ~14 GPa; ref. 26) but anomalously low shear modulus”. Despite 
its low elastic shear modulus c44 (16 GPa from ref. 27 and 18.4 GPa from 
our study), the Al;BC; lattice also stiffens steeply under (0001)[0110] 
shear strain, giving surprisingly high ideal shear strength of 24.7 GPa 
(Fig. 4d). The resulting t,ax/G ratio (1.34) is the highest ever known for 
a crystalline solid. In contrast, the ideal shear strength along the 
(0110)[2110] slip system is only 14.0 GPa, although the related elastic 
shear modulus ce (95 GPa from ref. 27 and 92.6 GPa from our study) is 
over fivefold larger than cy4. 

Unlike cementite, no new bonds form during shearing of Al;BC; 
(Fig. 4b and c). Instead, the atomistic origin of strain stiffening in 
AI;BC; is the strong repulsion between Al and B atoms due to severe 
squeezing of the B1—Al1 and the B1-Al6 bonds by the tilting of the 
rigid linear C-B-C units (Fig. 4e). Eventually, Al,BC; fails under 
(0001)[0110] shear owing to the complete rupture of the C1-All bond 
within the Al.C trigonal bipyramid, after which the equilibrium B1-Al1 
and B1-Al6 bond lengths are restored. Note that whereas cementite is 
ferromagnetic and a metallic conductor’, Al,BC; is a non-magnetic 
semiconductor’’. We anticipate that strain-stiffening may operate in 
other low-symmetry, layered materials with atomic degrees of freedom 
for internal relaxations. 

In conclusion, we have discovered that strain-stiffening, a popular 
phenomenon in many biological materials and elastomers'*”’, also 
unexpectedly occurs in simple crystalline solids. We have identified 
two different strain-stiffening mechanisms operating in inorganic 
solids that confer surprisingly high strength to materials with extre- 
mely low elastic constants. Our work highlights the untapped possibili- 
ties and richness of the strain-stiffening phenomenon in solids, and 
challenges the current philosophy of designing (super)hard materials 
largely on the basis of strong elastic properties*””*°. 


METHODS SUMMARY 


Ab initio ideal strength calculations are performed using the all-electron projector 
augmented wave method within the generalized gradient approximation of 
Perdew, Burke and Ernzerhof (ref. 2 in the Supplementary Information), as imple- 
mented in the Vienna ab initio simulation package (ref. 1 in the Supplementary 
Information). Spin-polarized calculations are performed to account for the fer- 
romagnetic nature of Fe;C. A high plane-wave cutoff energy of 500 eV and dense 
k-point sampling are employed to guarantee high numerical accuracy. To provide 
a more quantitative bonding picture of FesC, we have performed Mulliken bond 
overlap population analysis using the CASTEP code (ref. 5 in the Supplementary 
Information). Further details are given in the Supplementary Information. 


342 | NATURE | VOL 496 | 18 APRIL 2013 


Received 26 August 2012; accepted 6 February 2013. 
Published online 10 April 2013. 


1. Storm, C., Pastore, J.J., Mackintosh, F. C., Lubensky, T. C. & Janmey, P. A. Nonlinear 

elasticity in biological gels. Nature 435, 191-194 (2005). 

Shadwick, R. E. Mechanical design in arteries. J. Exp. Biol. 202, 3305-3313 (1999). 

Meyers, M. & Chawla, K. Mechanical Behavior of Materials Ch. 2 (Cambridge Univ. 

Press, 2008). 

4. Frenkel, J. Zur theorie der elastizitatsgrenze und der festigkeit kristallinischer 

korper. Z. Phys. 37, 572-609 (1926). 

Kelly, A. Strong Solids Ch. 1 (Clarendon Press, 1973). 

Teter, D. M. Computational alchemy: the search for new superhard materials. 

Mater. Res. Bull. 23, 22-27 (1998). 

7. Jhi, S.H., lhm, J., Louie, S. G. & Cohen, M. L. Electronic mechanism of hardness 
enhancement in transition-metal carbonitrides. Nature 399, 132-134 (1999). 

8. Wood, B. J. Carbon in the core. Earth Planet. Sci. Lett. 117, 593-607 (1993). 

9. Jiang, C., Srinivasan, S. G., Caro, A. & Maloy, S. A. Structural, elastic, and 
electronic properties of Fe3C from first principles. J. Appl. Phys. 103, 043502 
(2008). 

10. Wang, H. & Li, M. Ab initio calculations of second-, third-, and fourth-order elastic 

constants for single crystals. Phys. Rev. B 79, 224102 (2009). 

11. Nikolussi, N. et a/. Extreme elastic anisotropy of cementite, FesC: first-principles 

calculations and experimental evidence. Scr. Mater. 59, 814-817 (2008). 

12. Fang,C.M., Sluiter, M. H.F., van Huis, M.A. Ande, C.K. & Zandbergen, H. W. Origin of 

predominance of cementite among iron carbides in steel at elevated temperature. 

Phys. Rev. Lett. 105, 055503 (2010). 

13. Umemoto, M., Liu, Z. G., Masuyama, K. & Tsuchiy, K. Influence of alloy additions on 

production and properties of bulk cementite. Scr. Mater. 45, 391-397 (2001). 

14. Alkorta, J. & Sevillano, J. G. Assessment of elastic anisotropy and incipient plasticity 

in Fe3C by nanoindentation. J. Mater. Res. 27, 45-52 (2012). 

15. Legut, D., Friak, M. & Sob, M. Phase stability, elasticity, and theoretical strength of 

polonium from first principles. Phys. Rev. B 81, 214118 (2010). 

16. Grimvall, G., Magyari-Kope, B., Ozolins, V. & Persson, K. A. Lattice instabilities in 

metallic elements. Rev. Mod. Phys. 84, 945-986 (2012). 

17. Roundy, D., Krenn, C. R., Cohen, M. L. & Morris, J. W. Ideal shear strengths of fec 

uminum and copper. Phys. Rev. Lett. 82, 2713-2716 (1999). 

elling, R. H., Pickard, C. J., Payne, M. C. & Field, J. E. Theoretical strength and 

eavage of diamond. Phys. Rev. Lett. 84, 5160-5163 (2000). 

hacham, H. & Kleinman, L. Instabilities in diamond under high shear stress. Phys. 

ev. Lett. 85, 4904-4907 (2000). 

hi, S. H., Louie, S. G., Cohen, M. L. & Morris, J. W. Mechanical instability and ideal 

hear strength of transition metal carbides and nitrides. Phys. Rev. Lett. 87, 

75503 (2001). 

gata, S., Li, J.& Yip, S. Ideal pure shear strength of aluminum and copper. Science 

98, 807-811 (2002). 

hao, H. E. & Suo, Z. G. Theory of dielectric elastomers capable of giant deformation 

of actuation. Phys. Rev. Lett. 104, 178302 (2010). 

23. Sob, M.,Wang, L. G. & Vitek, V. Theoretical tensile stress in tungsten single crystals 
by full-potential first-principles calculations. Mater. Sci. Eng. A 234-236, 
1075-1078 (1997). 

24. Krenn, C. R., Roundy, D., Cohen, M. L., Chrzan, D. C. & Morris, J. W. Connecting 
atomistic and experimental estimates of ideal strength. Phys. Rev. B 65, 134111 
(2002). 

25. Minor,A.M. etal. Anew view of the onset of plasticity during the nanoindentation of 
aluminium. Nature Mater. 5, 697-702 (2006). 

26. Lee, S. H. et al. Chemical composition and microstructure of AlzBC3 prepared by 
different densification methods. J. Eur. Ceram. Soc. 30, 1015-1020 (2010). 

27. Wang, J. Y., Zhou, Y. C., Liao, T. & Lin, Z. J. First-principles prediction of low shear- 
strain resistance of AlzBC3: a metal borocarbide containing short linear BCz units. 
Appl. Phys. Lett. 89, 021917 (2006). 

28. Teter, D. M. & Hemley, R. J. Low-compressibility carbon nitrides. Science 271, 
53-55 (1996). 

29. Dubrovinsky, L. S. et al. The hardest known oxide. Nature 410, 653-654 (2001). 

30. Kaner,R.B.,Gilman,J.& Tolbert, S. H. Designing superhard materials. Science 308, 
1268-1269 (2005). 


wr 


ano 


oo 
eS) 


ine} 
i so 
OOY=-DO2 


i 
N 
ONN 


Supplementary Information is available in the online version of the paper. 


Acknowledgements C.J. acknowledges the support of a Director’s Fellowship at Los 
Alamos National Laboratory (LANL), where a systematic study of cementite was 
conceived and initiated. S.G.S. acknowledges support from the National Science 
Foundation (grant number 0846444). We also thank J. Wills, M. |. Baskes, A. Caro, 
A. Misra, S. Maloy, A. Srivastava, V. Vitek and S. Ranganathan for their discussions. 


Author Contributions C.J. and S.G.S. contributed equally to this work. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to C.J. (chaopsu@gmail.com) and 
S.GS. (srinivasan.srivilliputhur@unt.edu). 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


doi:10.1038/nature12044 


Mapping molecular motions leading to charge 
delocalization with ultrabright electrons 


Meng Gaol*, Cheng Lu!, Hubert Jean-Ruel’”, Lai Chung Liu’?, Alexander Marx, Ken Onda*“, Shin-ya Koshihara”’®, 
Yoshiaki Nakano’, Xiangfeng Shao’+, Takaaki Hiramatsu®, Gunzi Saito®, Hideki Yamochi’, Ryan R. Cooney", Gustavo Moriena’, 


German Sciaini>?* & R. J. Dwayne Miller’? 


Ultrafast processes can now be studied with the combined atomic 
spatial resolution of diffraction methods and the temporal resolu- 
tion of femtosecond optical spectroscopy by using femtosecond 
pulses of electrons’* or hard X-rays'’*” as structural probes. 
However, it is challenging to apply these methods to organic mate- 
rials, which have weak scattering centres, thermal lability, and poor 
heat conduction. These characteristics mean that the source needs 
to be extremely bright to enable us to obtain high-quality diffraction 
data before cumulative heating effects from the laser excitation 
either degrade the sample or mask the structural dynamics”. Here 
we show that a recently developed, ultrabright femtosecond electron 
source’ ° makes it possible to monitor the molecular motions in the 
organic salt (EDO-TTF)2PF, as it undergoes its photo-induced 
insulator-to-metal phase transition’’*. After the ultrafast laser 
excitation, we record time-delayed diffraction patterns that allow 
us to identify hundreds of Bragg reflections with which to map the 
structural evolution of the system. The data and supporting model 
calculations indicate the formation of a transient intermediate 
structure in the early stage of charge delocalization (less than five 
picoseconds), and reveal that the molecular motions driving its 
formation are distinct from those that, assisted by thermal relax- 
ation, convert the system into a metallic state on the hundred- 
picosecond timescale. These findings establish the potential of 
ultrabright femtosecond electron sources’"* for probing the pri- 
mary processes governing structural dynamics with atomic resolu- 
tion in labile systems relevant to chemistry and biology. 
(EDO-TTF).PFs (where EDO-TTF is ethylenedioxytetrathiaful- 
valene) is a quasi-one-dimensional, 3/4-band-filled charge-transfer 
organic salt that undergoes a thermally induced insulator-to-metal 
phase transition at a critical temperature of T, ~ 280 K (ref. 21) and also 
a highly efficient and ultrafast photo-induced phase transition’’*. Its 
electronic structure resembles that of Bechgaard salts, which provided 
the first organic superconductor, (TMTSF) PF, (see ref. 25). The origin 
of its insulator-to-metal phase transition involves a variety of collective 
phenomena”: Peierls-and-Holstein-type electron-phonon” and anti- 
ferromagnetic interactions” , that coexist with order-disorder*’* charge 
localization, and long-range Coulombic interactions have an important 
role in charge disproportionation™. In its metallic or high-temperature 
(HT) phase, electron donor EDO-TTF molecules (D) form columns of 
cations that are separated by sheets of acceptor PF, anions. The distri- 
bution of positive charges among EDO-TTF molecules along the stack- 
ing direction is represented by (D*°°D*°°D*°*D*°*) as shown in 
Fig. 1 (top right)’**. The high positive charge mobility along the cation 
stacks confers metallic properties at room temperature. The holes loca- 
lize below T, and endow the low-temperature (LT) phase with insulating 


properties. In the LT phase, the EDO-TTF molecules present a charge- 
ordered”!! state (D*'D*°D*°D*~?), in which a large bending of the 
neutral moieties promotes the doubling of the unit cell akin to a Peierls 
mechanism in a half-band-filled system, as shown in Fig. 1 (top left). 
Model calculations predict that vertical photo-excitation via the second 
charge-transfer band” leads to a localized {D**D*°D*°D"} excited 
state. Time-resolved optical reflectivity measurements in combination 
with theoretical modelling indicate” that this initial excited state evolves 
in less than 100 fs into a (D"'D*°D*'D*°) charge-disproportionate 
state, which has a lifetime of about 4 ps. Time-resolved optical studies 
in the near- and mid-infrared region identify large charge fluctuations as 
the driving process that finally leads to the complete randomization and 
melting of the charge order after about 100 ps (ref. 24). 

To probe the structural evolution in (EDO-TTF),PF, directly dur- 
ing its photo-induced insulator-to-metal phase transition, we per- 
formed femtosecond electron diffraction (FED) studies employing a 
recently developed ultrabright femtosecond electron source’ ’ that can 


Low temperature 


High temperature 


Figure 1 | Insulator-to-metal first-order phase transition in (EDO- 
TTF)2PF.. Top panels, illustration of the molecular and electronic changes 
associated with the thermal insulator-to-metal phase transition. Bottom panels, 
diffraction patterns for the LT and HT phases, obtained at 230 K and 295 K, 
respectively. The inset shows the assigned Miller indices (h, k, I). The symmetry 
breaking (cell doubling) corresponds to peaks indexed with k = 2n + 1 in the 
LT phase. For details about the definition of unit cell axes, see Supplementary 
Information section 6.1. 
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deliver high-quality diffraction from low-atomic-number materials 
even at the very low repetition rate of only 10 Hz used to avoid the 
adverse effects caused by cumulative heating. FED experiments were 
performed in transmission mode, with the electron beam propagating 
nearly perpendicular to the stacking direction of the EDO-TTF moieties 
to ensure high sensitivity to the most relevant molecular motions, that 
is, those associated with the dynamics of the EDO-TTF moieties and 
PF, counter-ions. Samples were kept at 230 K (for the LT phase) and 
photo-excited by 60-fs, 800-nm light while being monitored by ultra- 
bright femtosecond electron pulses, with 200 or more reflections enab- 
ling us to track time-dependent changes beyond the noise level. See the 
Supplementary Information for detailed information about methodo- 
logy (sections 1-3), sample preparation (section 5), sample orientation 
(sections 6.1 and 6.4), and control experiments (section 4). 

Figure 1 depicts the molecular arrangement of EDO-TTF moieties 
with their respective charge distributions for the LT and HT phases, 
along with the corresponding electron diffraction patterns. The 
appearance of a new family of peaks in the diffraction pattern of the 
LT phase is indicative of a cell-doubling effect, with the clear difference 
between the diffraction patterns of the LT and HT phases confirming 
that the crystal orientation allows the observation of the relevant struc- 
tural changes. 

We monitor the structural evolution of the system by collecting 
diffraction patterns at different time delays after photoexcitation and 
using the pattern of the initial LT phase as a reference, with the result- 
ing difference patterns shown in Fig. 2a. For comparison, Fig. 2b shows 
the difference between the diffraction patterns of the HT and LT 
phases (shown in Fig. 1). Figure 2c gives the temporal evolution of 
normalized intensity changes for some selected Bragg reflections, 
which allow separation of the observed photo-induced structural 
dynamics into a ‘fast’ component in the subpicosecond-to-picosecond 


range and a ‘slow’ structural change proceeding on the 100-ps time- 
scale. The fast dynamics can be explained as femtosecond optical 
excitation altering the charge distribution, modifying the inter- and 
intramolecular forces and driving coherent molecular motions. The 
slow dynamics are attributed to uncorrelated motions that provide an 
ensemble-averaged picture of the structural evolution, that is, a ther- 
mal relaxation process that brings the system to a metallic HT-type 
state**. The transfer of heat to the lattice manifests itself as slow oscilla- 
tions of diffracted intensities (see right panel in Fig. 2c) which are 
probably introduced through the generation of strain waves”. The 
most noteworthy feature is the plateau behaviour observed for time 
delays from about 3 ps to 10 ps (left panel of Fig. 2c), which reflects a 
transient intermediate structure (TIS). Given that optical studies indi- 
cate large charge fluctuations” prevailing in this time span”, it seems 
plausible that the effective intermolecular forces are screened and the 
structure therefore transiently locked. 

To obtain a time-dependent map of the relevant molecular motions 
driving the insulator-to-metal phase transition and the formation of 
the TIS, we developed a structural refinement algorithm based on a 
parameterized molecular model (see equations (4), (6), (7) and (8) in 
Supplementary Information section 7). The model makes use of the LT 
and HT structures as previously determined by X-ray diffraction” and 
considers about 40 Bragg reflections in the refinement calculations. 
The selection of reflections was based on their proximity to the Ewald 
sphere (Bragg condition) and the necessity of removing overlapped 
peaks. 

The entire data set of diffraction images was reduced to the square 
root of intensities, that is, structure factor amplitudes | Fexp (k, t)| 
(besides a k-dependent constant), where k denotes the scattering vec- 
tor (or Miller indices) and t is a given time delay. Given that optical 
excitation causes only a small fraction of unit cells (7...) to undergo 
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Figure 2 | Photo-induced time-dependent structural changes monitored by 
ultrabright FED. a, Differences between the diffraction patterns of the photo- 
induced and the initial LT phases as a function of the time delay between the 
optical excitation and electron probe pulses. b, Difference between the 
diffraction patterns of the HT and LT phases (shown in Fig. 1). ¢, Relative 
intensity changes for a few selected reflections. On the left side of the vertical 
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green line, fast dynamical processes are shown; on the right side, slow 
dynamical processes are shown. The excitation fluence was 0.55 mJ cm” *. The 
sample was photo-excited by 60-fs, 800-nm optical pulses at a repetition rate of 
10 Hz. Error bars represent the standard deviation of measured values within 
the set of ten identical trials. 
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structural changes (see Supplementary Figs 9, 12 and 15), the ob- 
served experimental structure factor amplitudes can be written” as 
|Fexp(k, t)| = |MexeFexc(k, t) + (1 — Mexc)Fur(k)|, where Fexc(k, t) 
is the relevant part showing time-dependent changes and F,,7(k) are 
the structure factor amplitudes of the initial LT phase. The structure 
factor amplitudes are weighted by the fraction of unit cells being 
altered, and remaining in the LT phase, respectively. Our model refine- 
ment calculations compare | Fexp (ks t)| with simulated amplitudes 
|Fsim(, €)| = |excPsim, exc(k, €) + (1 — Mexc)Fsim, ur(k)| and maxi- 
mize the Pearson correlation coefficient, y( | Fexp|, |Fsim|). This figure of 
merit y varies between 0 (no correlation) and 1 (full correlation) and 
was found to provide well-behaved maxima across the full time series 
(see Supplementary Figs 14 and 15). 

The term ¢ is an atomic structural parameter composed of the three 
time-dependent generalized reaction coordinates (Cp, €p, €p) assigned 
to the motion of three independent dynamic groups: flat EDO-TTF 
molecules, bent EDO-TTF moieties and PF, anions, respectively. The 
configuration space was created from a linear interpolation (for each 
dynamic group) between the initial {¢ = (0,0,0)} LT and the final 
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Figure 3 | Reaction coordinates and their temporal evolution in the 
formation of TIS. a, Three unit cells illustrating the arrangement of EDO-TTF 
molecules and PF, counter-ions in the LT phase. Selected dynamical groups are 
illustrated using different background colours in the central unit cell: flat EDO- 
TTF moieties have a green background, bent EDO-TTF moieties have a pink 
background, and PF, anions have a cyan background. Green sticks denote the 
atomic displacements from the initial LT structure to the TIS (unit cell on the 
left) and the HT-type structure (unit cell on the right); displacements were 
magnified 10 and 5 times, respectively, for clarity. The root-mean-square 
displacement between the LT structure and the TIS is 0.18 A and between the 
LT and HT-type structures is 0.44 A. The atomic coordinates of the TIS are 
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{¢€ = (1,1, 1)} HT-type structures (see Supplementary Table 2). We 
believe that the reduction of the problem to such reaction coordinates 
(see Fig. 3a) should capture most of the physics and, at the same time, 
avoid unphysical results as a consequence of over-fitting. A value of 
Nexe ~ 0.1 was obtained by maximizing y(7.,.) in the time interval of 
0-10 ps (see Supplementary Fig. 15) and found to be in very good 
agreement with the estimated number of converted unit cells into 
the HT-type state obtained from equation (5) of the Supplementary 
Information and from the decrease at t = 1 ns in the structure factor 
amplitude averaged over (h,k = 2n + 1,])-reflections (where n is an 
integer) shown in Supplementary Fig. 9. Such reflections disappear in 
the HT phase owing to its higher symmetry (see Supplementary 
Information section 6.2). 

These last two estimations of 7x. rely on the fact that by t = 1 ns the 
system has fully thermalized in a HT-type state and little residual 
thermal stress is left. These are fair assumptions considering that for 
t > 500 ps, the oscillations are damped, the estimated temperature 
increase is less than 50 K and the effect of residual stress is minimized 
for reflections satisfying Bragg’s law”. From optical reflectivity and 


LT = HT (x5) 


Reaction coordinates 


0 5 10 15 
Time delay (ps) 


listed in Supplementary Table 2. b, Temporal evolution of € in a (Cp, &p, €p)- 
Cartesian configuration space. Circles indicate optimized structures. The red 
trace is a multi-exponential fit of the optimum correlation trajectory. The green 
trajectories are two-dimensional projections. c, Temporal evolution of each 
reaction coordinate and its contribution in the formation of the TIS. Error bars 
represent the full-width at half-maximum (FWHM) of the global maximum 
peaks of the Pearson correlation coefficient in reaction coordinate space. Solid 
traces are multi-exponential fits; see equation (9) in the Supplementary 
Information and Supplementary Table 1. See Supplementary Videos for full 
molecular movies highlighting the key molecular motions leading to the 
formation of the TIS state. 
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in situ transmission measurements we estimated that, for the excita- 
tion level used in the experiments, about 8% of the LT unit cells are 
initially photo-excited (see Supplementary Information section 2). The 
above observations indicate that the initial excitonic correlation length 
only extends over about one LT unit cell, promoting local structural 
changes. 

Figure 3a depicts the selected dynamical groups (central unit cell) 
and the atomic displacements connecting the LT structure with the TIS 
(on the left) and the HT-type structures (on the right), respectively. 
The root-mean-square displacement between the LT structure and 
the TIS is 0.18 A and between the LT and HT structures is 0.44 A. 
Figure 3b summarizes the time-dependent changes of ¢, illustrated 
in a (¢;, ¢p €p)-Cartesian configuration space. As can be seen from 
the two-dimensional projections, the motions of flat EDO-TTF moieties 
and of PF, counter-ions are correlated. The temporal evolution of 
each reaction coordinate is also shown in Fig. 3c, for clearness. The 
relative values among reaction coordinates suggest that optical excita- 
tion mostly drives the formation of the TIS through the sliding of 
flat EDO-TTF moieties and the correlated motion of PF, counter-ions. 
This observation is also supported by their respective overshoot values, 
A= €(1ps) — €(5ps), which are indicative of an over-damped half- 
cycle motion. The close proximity of PF, anions to EDO-TTF molecules 
and the observed correlation suggest that steric effects—that is, short- 
range repulsive interactions— are also important in the crystalline order 
and positioning of the anions. 

The addition of nearest-neighbour Coulomb repulsion and electron- 
phonon interactions that modulate site energies through anion displa- 
cements were found to be crucial to reproduce the spectral signature of 
the (D~'D*°D*'D*°) excited state via an extended Peierls—Holstein- 
Hubbard model”’. Our results correspondingly indicate the motion of 
PF, counter-ions as a relevant component of the structural evolution of 
the excited electronic state and the formation of the TIS. Their connec- 
tion with charge fluctuations suggests that PF; counter-ions need to be 
modified in order to gain control over electronic properties in (EDO- 
TTF),PF,, akin to the anion replacement that confers superconductive 
properties at ambient pressure when going from (TMTSF),PF, to 
(TMTSF),ClO, (ref. 25). In contrast, the small relative amplitude for 
the unbending of EDO-TTF molecules in the formation of the TIS 
questions the relevance of this simple bending-type distortion mode 
in the formation of its charge disproportionates. Future FED experi- 
ments will focus on different sample orientations with improved spatial 
resolution and apply more general refinement algorithms for real-space 
reconstruction and ab initio electronic calculations to link dynamical 
structures with time-evolving charge distributions. 
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Terrestrial water fluxes dominated by transpiration 


Scott Jasechko!, Zachary D. Sharp!, John J. Gibson’, S. Jean Birks®*, Yi Yi? & Peter J. Fawcett! 


Renewable fresh water over continents has input from precipita- 
tion and losses to the atmosphere through evaporation and tran- 
spiration. Global-scale estimates of transpiration from climate 
models are poorly constrained owing to large uncertainties in sto- 
matal conductance and the lack of catchment-scale measurements 
required for model calibration, resulting in a range of predictions 
spanning 20 to 65 per cent of total terrestrial evapotranspiration 
(14,000 to 41,000 km? per year) (refs 1-5). Here we use the distinct 
isotope effects of transpiration and evaporation to show that trans- 
piration is by far the largest water flux from Earth’s continents, 
representing 80 to 90 per cent of terrestrial evapotranspiration. On 
the basis of our analysis of a global data set of large lakes and rivers, 
we conclude that transpiration recycles 62,000 + 8,000 km? of 
water per year to the atmosphere, using half of all solar energy 
absorbed by land surfaces in the process. We also calculate CO, 
uptake by terrestrial vegetation by connecting transpiration losses 
to carbon assimilation using water-use efficiency ratios of plants, 
and show the global gross primary productivity to be 129 + 32 giga- 
tonnes of carbon per year, which agrees, within the uncertainty, 
with previous estimates’. The dominance of transpiration water 
fluxes in continental evapotranspiration suggests that, from the 
point of view of water resource forecasting, climate model develop- 
ment should prioritize improvements in simulations of biological 
fluxes rather than physical (evaporation) fluxes. 


100 


Unlike river discharges to the oceans’, the global fluxes of evapora- 
tion and transpiration are poorly constrained owing to a lack of meth- 
odology to decouple these two water fluxes at the catchment scale. 
Stable isotope ratios of oxygen (‘*O/'°O) and hydrogen (*H/"H) in 
water can be used to separate transpiration from evaporation®, because 
the two processes have different effects on these ratios in water. The 
physical process of evaporation enriches residual water in the heavy 
isotopes of oxygen and hydrogen, whereas the biological process of 
transpiration does not produce an isotopic fractionation, assuming 
an isotopic steady state over annual timescales*"'. The pathway water 
takes after falling as precipitation within a catchment includes mixing, 
evaporation (fractionation labelled) and transpiration (non-fractionation 
labelled), until the remaining water accumulates in a downstream lake or 
river. Each of these catchment processes is ultimately recorded by the 
isotopic composition of the lake’s water. We have compiled a data set of 
5'°O and 8H values of large lake waters and capitalize on dissimilar 
isotope effects between evaporation and transpiration to decouple and 
quantify these two freshwater losses from Earth’s surface (isotope 
content is given by (Reamplel Ry. smow — 1) X 10°%o, where R is 
'80/'°O for 5'°O and ?H/'H for 5°H, and V-SMOW represents standard 
mean ocean water). 

To proceed with this calculation, we first report on the stable oxygen 
and hydrogen isotope compositions of Earth’s large lakes (Fig. 1). The 
isotopic compositions of lake waters show a broad range in 5'°O and 
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Figure 1 | 5'8O and 57H values of large lakes and semi-enclosed seas. The 
global meteoric water line’? (GMWL) is shown. The map at top left shows 
catchment areas covered by the data set. The schematic graph at bottom right 


shows water inputs to a lake (diamond) and the evaporation trajectory of a lake 
(percentages refer to evaporation amount). 
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SH values: —23%o to +15%o and —180%o to +80%o, respectively. 
Well-mixed lakes (for example, the North American Great Lakes 
and Lake Baikal) have relatively homogenous stable isotope composi- 
tions, whereas perennially stratified or shallow lakes tend to have 
greater isotopic variability (for example Lake Kivu and Lake Chad). 
Headwater lakes located at high latitudes and altitudes (for example, 
Kluane Lake) have the lowest 8'80 and 87H values, whereas the closed 
basin lakes of eastern Africa have the highest 5'*O and 87H values (for 
example, Lake Afdera and Lake Turkana). Global lakes do not follow 
one systematic evaporation trend, reflecting the unique climatology 
and hydrology of each individual lake catchment. The global meteoric 
water line plotted in Fig. 1 is a regression of 5'°O and 5°H values of 
precipitation samples on a global scale’*. This regression produces a 
5°H/5'*O slope of eight that can be closely reconciled by liquid-vapour 
isotope effects at chemical equilibrium’*. However, the disequilibrium 
process of evaporation results in a strong kinetic isotope effect, with the 
light isotopologues preferentially partitioned into the vapour phase. 
This results in 5°H/85'°O slopes of less than eight, driving the isotope 
composition of lake waters ‘below’ the global meteoric water line. 
Information on the percentage of evaporative losses is retained by the 
difference between the lake’s isotope composition and that of waters 
entering a basin (Oinpus Fig. 1, inset schematic graph). Our global data 
compilation shows that nearly all lakes fall to the right of the global 
meteoric water line in 5'°O-87H space as a result of evaporation, except 
in special cases where waters evaporate upwind and re-precipitate in a 
downwind lake basin (for example Lake Biwa). In what follows, we 
develop equations describing a stable isotope mass balance of waters 
within a lake catchment to estimate the percentage of catchment trans- 
piration, and apply these equations to 8'°O and 8°H data for large lake 
waters. 

A lake catchment in hydrologic steady state can be described by a 
balance between water inputs (J, precipitation and inflows from 
upstream lakes) and water losses such as precipitation intercepted by 
vegetation (xP, where x is the fraction of intercepted precipitation for 
the catchment), open-water and soil evaporation (E), transpiration (T) 
and liquid losses to rivers or groundwater discharge (Q): 


I=xP+E+T+Q (1) 


Similarly, a stable isotope mass balance of a lake catchment is obtained 
by considering the isotopic composition of each water flux (6;, dz and 
so on): 


By combining equations(1) and (2), we develop a new equation 
describing transpiration losses from a catchment: 


_ 1(d;—5z) — Q(6q — dz) —xP(5p — dz) (3) 
— Or —OE 


Each parameter in equation (3) can be estimated from gridded data 
sets or lake-specific studies, except for the isotope composition of 
evaporate (dz). To calculate 6, (ref. 14), we use an evaporation model 
based on laboratory-derived liquid—vapour fractionation factors’’. The 
isotope composition of soil and open-water evaporate are grouped into 
one term (dz), and the isotopic composition of transpired moisture 
(67) is calculated using both shallow and deep-water sources under the 
assumption that the catchment is in steady state at an annual time 
step'' (on average, water molecules spend multiple years within lakes 
examined here). Physical, isotopic and hydroclimatic data sets for each 
lake are compiled from available reanalysis and interpolated data’*"* 
and are used in equation (3) to calculate catchment transpiration 
(Methods). 

We find that transpiration accounts for more than two-thirds of 
total surface water evapotranspiration for 85% of the catchments 
examined (Fig. 2 and Fig. 3a). Remarkably, transpiration also accounts 
for the majority of evapotranspiration in desert catchments (average, 
75%; range, 35% to 95%). In situ transpiration measurements in 
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deserts range from 7% to 80% of evapotranspiration”’, in large part 
because precipitation rates are highest in the headwaters of desert 
catchments, thereby increasing the importance of these forested eco- 
systems to the catchment’s water balance. Transpiration rates range 
from less than 100 mmyr ‘ to approximately 1,300 mm yr’ ' (Fig. 2 
and Supplementary Information, section 1). Our results show that 
even though open-water evaporation may locally occur at higher rates 
than transpiration, the fraction of total evapotranspiration represented 
by evaporation is severely limited by the small areas of open water on 
Earth’s continents (approximately 3%, globally”). Therefore, we posit 
that the biological pump of water into the atmosphere during photo- 
synthetic gas exchange (that is, transpiration), rather than the physical 
process of evaporation, dominates water losses from the continents. 
Because plant roots are able to tap into groundwater and soil-water 
reservoirs, transpiration effectively moves deep sources of water into 
the atmosphere, whereas evaporation is only effective for water at or 
near the surface, which explains the very high proportion of transpira- 
tion to the overall evapotranspiration flux. 

Our results are supported by a cross-plot comparison of isotope-based 
and conventional open-water evaporation rates (R” = 0.78 (squared 
correlation coefficient), slope = 0.92; Supplementary Fig. 1); geographic 
similarity between compiled in situ transpiration measurements””® (of, 
for example, sap flow) at the forest stand level and our estimates (Fig. 3); 
and agreement between '%O/'°O- and *H/'H-based evaporation rates 
using equation (3) (R* = 0.78, slope = 0.94; Supplementary Figs 2 and 3 
and Supplementary Information, section 1). We also note that the time 
step of our calculated transpiration fluxes ranges from 1 to 1,000 years, 
averaging 40 years, as dictated by the hydrologic residence time of each 
lake (Supplementary Table 3). To scale up our calculation to Earth’s 
ice-free land surface, and to provide a fourth check corroborating our 
catchment transpiration results, we estimate the global transpiration 
from Earth’s landmasses (excluding Antarctica) from a stable isotope 
mass balance of Earth’s entire freshwater reservoir. This estimate is 
based on the deuterium excess parameter, which includes informa- 
tion contained in both '80/'°O and *H/'H (d= 5°H — 88'8O). We 
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Figure 2 | Transpiration water losses for 56 lake catchments grouped by 
ecoregion ('°O/'°O-based results). Each coloured bar represents results for a 
single lake catchment. Extents of bars show 25th and 75th percentiles of Monte 
Carlo simulations. Median transpiration (T; square) outputs of Monte Carlo 
simulations and total evapotranspiration losses (solid line) are shown. The 
median result is close to the total evapotranspiration for most of the lakes, 
demonstrating the dominant role of transpiration in total evapotranspiration 
losses. 
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Transpiration relative to evapotranspiration 


Figure 3 | Transpiration and carbon fluxes within 73 lake catchments. 

a, Transpiration losses as a percentage of total evapotranspiration. 

b, Transpiration rates. c, Gross primary productivity for 10% of Earth’s 
continental area. Coloured diamonds are shown for small basins as a visual aid. 
Inverted triangles represent compiled in situ transpiration measurements (for 
example sap flow’). 


obtain a similar expression to equation (3) based on the deuterium 
excess: 


P(dp 


dz) —Q(dg—dz) 


dr —dz 


xP(dp dr) 


p= 


(4) 


Terrestrial precipitation (P= 110,000 + 10,000 km? yr | (ref. 17), 
dp = 9.5 + 1%o (ref. 12)) is the only input of water to the continents, 
and water is lost through river discharges to the oceans (Q= 
37,300 + 700 km? yr! (ref. 7), dg = 6.8 + 3.8%o; Supplementary Infor- 
mation, section 2), terrestrial evaporation (dg = 75 + 30%p (this work)), 
transpiration (dr = 8+ 3%) or interception by vegetation (xP = 
7,500 + 1,500 km? yr! (refs 16, 17), dp = 9.5 = 1%o (ref. 12)). Solving 
equation (4) shows that transpiration accounts for 80% to 90% of ter- 
restrial evapotranspiration (respectively the 25th and 75th percentiles of 
a Monte Carlo sensitivity analysis). Volumetrically, transpiration con- 
verts 62,000 + 8,000 km’ yr_' of liquid water into atmospheric vapour, 
requiring 33 + 4 Wm ” of latent heat, or roughly half of all solar energy 
absorbed by the continents”! (approximately 70 W m_”). Results show 
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that 90% of precipitation falling on land’* (111,000 km’ yr‘ *) is already 
appropriated to ecosystems for either primary production (62,000 + 
8,000 km? yr’) or as aquatic habitat in rivers’ (37,000 km? yr’), an 
important consideration for diversion of in-stream flows. 

We can use our transpiration fluxes to calculate carbon assimilation 
by terrestrial vegetation by linking the water and carbon cycles”. The 
molar ratio of CO; assimilated during photosynthesis to transpired 
H,0 is known as water-use efficiency (WUE) and is dependent on a 
variety of factors including the type of photosynthetic pathway used by 
a particular plant species (C3, Cy or CAM) and atmospheric conditions 
such as vapour pressure deficit and CO, concentration”*. We compile 
WUE data and couple these to atmospheric vapour pressure deficits 
and C3-C, vegetation abundances to develop global grids of WUE 
(Methods). Catchment transpiration fluxes and gridded WUEs are 
applied to calculate gross primary production within each catchment 
(Fig. 3c). On the global scale, we weight our global grids of WUE 
according to vegetation density and calculate the global WUE of the 
terrestrial biosphere to be 3.2 + 0.9 mmol CO; per mol H30. Applying 
this ratio to our global transpiration flux (62,000 + 8,000 km? yr | of 
H,0), we calculate gross primary production to be 129 + 32 GtC yr’. 
This flux is consistent with a recent estimate of 123+ 8GtCyr * 
(ref. 6) and provides a fifth line of support for the large transpiration 
fluxes reported in our work. 

Linkages made here between the water and carbon cycles highlight a 
new stable-isotope-based methodology that can be used to monitor 
and map ongoing changes to Earth’s water cycle**’° as well as modi- 
fications to carbon assimilation rates under increased atmospheric 
temperatures and CO, concentrations. Our results show that the water 
and carbon cycles are linked in such a way that transpiration must 
account for more than 80% of continental evapotranspiration to main- 
tain a mass balance between these two biogeochemical fluxes (plant 
transzpiration and CO, uptake). Given the importance of transpira- 
tion, it follows that the physiological response of vegetation to a war- 
mer and CO,-enriched atmosphere will have a dominant effect on 
future changes to evapotranspiration and the terrestrial hydrological 
cycle. Furthermore, changes in natural ecosystems via land-use modi- 
fications or climate changes will have notable effects on river discharges 
and, consequently, fluvial sediment loads, chemical weathering on con- 
tinents, and atmospheric latent heat transport. 

Climate change is expected to affect global transpiration’. Conside- 
ring the dominance of transpiration in continental evapotranspiration 
shown here, future changes in global transpiration will affect land 
temperatures by altering latent heat fluxes from continents, and will 
also change the fraction of precipitation entering rivers. Our catch- 
ment-scale results can be applied as a calibration tool for climate 
models, which should shift the prevailing focus on physical climate 
data towards ecosystem water requirements and so result in better 
predictions of continental evapotranspiration and water recycling in 
a warmer future climate. 


METHODS SUMMARY 


In equation (3), hydrologic inputs (J) include precipitation'” and upstream lake 
inflows. Catchment losses include interception (xP; ref. 16), liquid outflows (Q; 
lake-specific data), evaporation (E) and transpiration (T). The isotopic composi- 
tion of precipitation (dp) is computed from monthly grids'’ weighted spatially 
(grid cell, i) and temporally (month, j) to precipitation distribution as follows: 


= P; (5) 
2D , 


For chain lakes, inflows from an upstream lake are included in the calculation of 6;. 
For lakes not in equilibrium with current climate, 5; is calculated from the inter- 
cept between a computed evaporation trend” and the global meteoric water line’. 
The isotopic composition of lake water is used to estimate that of liquid outflows 
(that is, 6g = djake). The isotopic composition of transpired moisture (67) is esti- 
mated by weighting the isotopic composition of precipitation’” spatially (i) to 
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mean long-term normalized difference vegetation indices (NDVI; proxy for 
vegetation density). A range of two temporal (j) weighting approaches’® is used 
for 67, one weighted to growing season (NDVI values (equation (5)) and another 
to monthly precipitation (equation (6)): 


12 q 
. 1 Soar NDVIj6p 
Ors a / ) NDVI; 6 
T-SHALLOW ye, NDVI, > ( =o NDVI, (6) 


: 1 7 
OT—DEEP i 
NDVI; 


NDVI; (7) 


i=1 ‘ 
i 


Finally, the isotopic composition of evaporate is computed using an evaporation 
model": 


(Otake — &*) /o* —hda — ex 
l—h+ex 


og (8) 
This takes into account temperature-dependent’” equilibrium’? («*; e* = «* — 1) 
and humidity-dependent” (/) kinetic (éx; ref. 27) fractionation factors normalized 
to lake temperatures”. The isotopic composition of atmospheric vapour (64) is 
computed on the basis of a precipitation-equilibrium assumption’*'*'”° and by 
using isotope-enabled climate model grids'*. Humidity, temperature and 6, are 
each weighted to the monthly evaporation amount. A Monte Carlo simulation is 
used to assess calculation uncertainty embedded in each input parameter 
(Supplementary Information, sections3 and 4). Plant water-use efficiency is 
calculated by applying growing-season daytime vapour pressure deficit!’”° 
(VPD) to C; (WUEc3 = 4.21(VPD) °®’ mmol CO, per mol H,O) and Cy 
(WUEcy = 6.91(VPD) °° mmol CO per mol HO) vegetation abundances”. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


For each basin, eight input terms are required for calculation of transpiration 
losses using equation (3): I, Q, x, dq, 6p O7, Op and dg. Water inputs (J) include 
precipitation and upstream lake inflows. Precipitation inputs are obtained from 
high-resolution physical climate grids'’. Upstream chain lake inflows are retrieved 
from lake-specific sources, as are liquid outflows (Q) for each lake. The proportion 
of incident precipitation that is intercepted and returned to the atmosphere (x) is 
obtained from satellite-based gridded data’®. The isotope composition of liquid 
outflows (dQ) from each lake is obtained from epilimnion 3!80 and 8H values for 
samples nearest to a lake’s outflow (djake = 6Q). The isotope composition of pre- 
cipitation entering a basin is calculated by weighting spatially (grid cell, i) and 
temporally (month, j) to monthly precipitation amount (P) as follows: 


12 
; 1 ta (D5 -1 Fide 
6 = Z P; 5 


Monthly 6p estimates'® and monthly precipitation amounts are obtained from 
gridded data sets. For headwater lakes, precipitation is the sole input (that is, = P 
and 6; = 6p); for chain lakes, dp is flux-weighted against the isotopic composition 
of riverine inputs entering the catchment from upstream chain lakes. Large lakes 
with residence times longer than approximately 300 years (for example, Lake 
Baikal) are in an isotopic disequilibrium with current climate. For these lakes, a 
3°H/8'%O evaporation slope is calculated”* using the lake isotope data. The inter- 
cept of the resulting ‘evaporation line’ with the global meteoric water line’? is applied 
as amean ‘long-term’ estimate of 6;. This approach is also applied to headwater lakes 
where grids produce unrealistic 6, estimates (6; > jake). Lakes with an outflow that 
periodically reverses flow (Tonlé Sap and Poyang Lake) or mixes with a geographi- 
cally separate lake (Lake Michigan and Lake Huron) are treated specially, and the 
isotopic composition and flux of return flows are included in the computation of 67. 

The isotopic composition of transpired moisture is calculated using an average 
of two approaches (both based on similar dp grids’), the first representing the 
isotope composition of shallow waters during the growing season and the second 
estimating the annual isotope composition of recharge. Plants tapping shallow 
water sources draw on precipitation falling during the growing season, whereas 
deep-rooted vegetation transpires ground waters that more closely represent the 
recharge-weighted isotope composition of precipitation”. 

To estimate the isotope composition of transpired moisture (6) for shallow 
“growing season’ waters, the isotope composition of precipitation (dp) is weighted 
to a proxy for chlorophyll abundance using long-term monthly mean values of 
normalized difference vegetation indices (NDVI; equation (5)). Monthly NDVI 
values less than zero were assigned a value of zero, because these cells host minor 
amounts of photosynthetic activity: 


i 2 (So), NDVI», 
6 T—SHALLOW n = 
oie NDVI i _, NDVI, 


i=1 ‘ 
i 


NDVI; (6) 


To estimate the isotope composition of transpired moisture from vegetation with 
deep roots, we weight the isotope composition of precipitation temporally to 
monthly precipitation amount, and then consider the spatial distribution of 
vegetation by applying long-term annual average NDVI indices: 
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1 
The range of values from the two approaches is used to estimate 6d, (see 
Supplementary Information, section 3 for treatment of uncertainty in each input 
parameter). 

The isotope composition of evaporated moisture is estimated using an evap- 
oration model developed to calculate the isotopic composition of evaporate’: 


(Otake —e*) /o* —hda — €K 


} 
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(8) 


where diake is the isotope composition of each lake, «* is the temperature- 
dependent” equilibrium liquid—vapour fractionation factor’’ (with e* = «* — 1), 
h is the relative humidity’? normalized to surface water temperatures and 
& = Cx(1 — h) isa kinetic separation factor with Cx representing resistance ratios 
for various stable water isotopologues (Cx = 14.2 for the 3'80 model and 
Cx = 12.5 for the 8°H model”’). 

To calculate 6,, four inputs are required: atmospheric specific humidity, lake 
water temperature (T,), air temperature (T,) and the isotopic composition of 
atmospheric moisture (6,). Specific humidity values are calculated”* using long- 
term monthly mean temperature and relative humidity grids'” (ha), lake surface 
temperatures are used to calculate monthly saturation vapour pressures, and then 
relative humidity is computed using the specific humidity and computed satura- 
tion vapour pressure. The isotope composition of atmospheric water vapour (64) 
is obtained in two ways. First, the isotope composition of atmospheric moisture 
can be estimated by assuming that the isotope composition of precipitation reflects 
the isotopic atmospheric vapour offset by equilibrium isotope effects”* (calculated 
with monthly gridded 6p values'* and air temperatures”, applied to liquid~vapour 
equilibrium fractionation factors'*). Alternatively, values for 6, are derived from 
long-term monthly average outputs from an isotope-enabled global climate model 
(IsoGSM'*). An average 5, value is taken from the two approaches and used as a 
first estimate (Supplementary Information, section 3). Finally, atmospheric 
vapour 8180 and 87H values, humidity, and lake and air temperatures are weighted 
temporally to monthly evaporation amount for each lake. This is a crucial step for 
lakes that experience large seasonal variations in evaporation fluxes. Uncertainty 
in each input parameter is assessed by a Monte Carlo analysis (Supplementary 
Information, sections 3 and 4). Plant-scale water-use efficiency grids are developed 
for Earth from a compilation of data sets (Supplementary Table 6 and 
Supplementary Figs 5 and 6). Daytime vapour pressure deficit (VPD) is calculated 
at a monthly time step using humidity'” and daytime temperature (maximum 
daily temperature minus average daily temperature”’), and is weighted to growing 
season via NDVI indices. These growing-season daytime vapour pressure 
deficits are input into VPD-WUE regressions from compiled data for C3 
(WUEc;3 = 4.21(VPD) °°’ mmol CO, per mol H,O) and Cy (WUEcs = 
6.91(VPD) °*° mmol CO, per mol HO) vegetation, and the respective propor- 
tions of C; and Cy, plants within each grid cell*® are applied to develop WUE grids 
for Earth. WUE grids are applied to catchment transpiration fluxes to calculate 
gross primary production within each lake basin. 
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Earliest evidence for the use of pottery 


OnE: Craig", H. Saul?, A. Lucquin!, Y. Nishida’, K. Taché!, L. Clarke**, A. Thompson’, D. T. Altoft!, J. Uchiyama®, M. Ajimoto’, 


K. Gibbs®, S. Isaksson”, CG. P. Heron? & P. Jordan" 


Pottery was a hunter-gatherer innovation that first emerged in East 
Asia between 20,000 and 12,000 calibrated years before present” 
(cal BP), towards the end of the Late Pleistocene epoch, a period of 
time when humans were adjusting to changing climates and new 
environments. Ceramic container technologies were one of a range 
of late glacial adaptations that were pivotal to structuring sub- 
sequent cultural trajectories in different regions of the world, but 
the reasons for their emergence and widespread uptake are poorly 
understood. The first ceramic containers must have provided pre- 
historic hunter-gatherers with attractive new strategies for proces- 
sing and consuming foodstuffs, but virtually nothing is known of 
how early pots were used. Here we report the chemical analysis of 
food residues associated with Late Pleistocene pottery, focusing on 
one of the best-studied prehistoric ceramic sequences in the world, 
the Japanese Jomon. We demonstrate that lipids can be recovered 
reliably from charred surface deposits adhering to pottery dating 
from about 15,000 to 11,800 cal sp (the Incipient Jomon period), 
the oldest pottery so far investigated, and that in most cases these 
organic compounds are unequivocally derived from processing 
freshwater and marine organisms. Stable isotope data support the 
lipid evidence and suggest that most of the 101 charred deposits 
analysed, from across the major islands of Japan, were derived from 
high-trophic-level aquatic food. Productive aquatic ecotones were 
heavily exploited by late glacial foragers’, perhaps providing an 
initial impetus for investment in ceramic container technology, 
and paving the way for further intensification of pottery use by 
hunter-gatherers in the early Holocene epoch. Now that we have 
shown that it is possible to analyse organic residues from some of 
the world’s earliest ceramic vessels, the subsequent development of 
this critical technology can be clarified through further widespread 
testing of hunter-gatherer pottery from later periods. 

On the basis of evidence from East Asia, North Africa, the Americas 
and Northern Europe, it is now widely accepted that pottery produc- 
tion precedes the emergence of farming and should be considered a 
hunter-gatherer innovation. From radiocarbon dates of an archaeolo- 
gical sequence excavated at Xianrendong Cave, Jiangxi Province, 
China, it has been proposed that ceramic container technology extends 
in age to about 20,000-19,000 cal BP’, some 10,000 yr before the earliest 
evidence of rice cultivation in that region. Regional case studies*” have 
explored the origins, adoption and dispersal of ceramic container 
technology, dates are being refined and pushed back, and models are 
emerging that encompass economic, technological, social and environ- 
mental factors that both facilitated, and potentially inhibited, pottery 
innovation in different parts of the world’. Yet the actual uses of these 
early pottery vessels have not been addressed beyond assumed func- 
tional roles such as cooking and storage, or speculative associations with 
available animal and plant resources’ ”. 

An understanding of how pottery was used is necessary to elucidate 
both the social and the economic context for its innovation and also to 


explain why the technology flourished in some hunter-gatherer groups 
and not others. Direct evidence of pottery use can be gained through 
molecular and stable isotope analyses of lipid residues and other organic 
matter preserved in association with ceramics'®’. Although such 
investigations have been widely conducted on vessels associated with 
early agricultural or pastoral communities'*”’, the use of earlier hunter- 
gatherer pottery has been largely ignored or considered only for late 
forager groups'*’’. The main challenge limiting systematic investi- 
gations of early hunter-gatherer pottery use is the extent to which 
molecular evidence is preserved after such long-term exposure to the 
burial environment. The earliest known pottery use in Eurasia gradually 
developed, probably independently, in Southern China, the Amur River 
basin (Russian Far East) and Japan at the end of the Late Pleistocene’”, 
necessitating the recovery of organic residues at least 3,000 yr older than 
has so far been reported’’. The rarity of the very earliest pottery vessels 
also limits widespread destructive residue analysis. Here we present an 
investigation of Late Pleistocene pottery use, focusing on the Japanese 
ceramic sequence known as the Incipient Jomon, using small sub- 
samples (~20 mg) of charred deposits adhering to the inside and 
outside of potsherds. We investigated the carbon and nitrogen stable 
isotope characteristics of the charred deposits using bulk isotope ratio 
mass spectrometry (IRMS). Where the sample was large enough, 
lipids were extracted and analysed by gas chromatography mass spec- 
trometry (GCMS) and GC combustion IRMS (GC-C-IRMS) to deter- 
mine their structural and carbon isotope characteristics'®”’. 

Incipient Jomon pottery is thought to have been invented in Japan 
~16,000 cal Bp’, and became established across the major islands of the 
Japanese archipelago during the last 4,000 yr of the Late Pleistocene’®. 
Much is known about the glacial foragers who developed this pottery. 
Their sites have been well dated’’ and the pottery sequence has been 
intensively studied’: three major phases of typological development have 
been identified, each broadly mapping onto different climate phases of 
the final Pleistocene (Fig. 1). Japanese Late Pleistocene pottery con- 
tainers were produced in only very small quantities (Fig. 1b) and their 
estimated volumes were also relatively low. Therefore, pottery may 
have initially been reserved for specialized subsistence activities or 
ritual use, rather than being a widespread technology integral to every- 
day culinary practices. Nevertheless, this long-lived nascent phase, which 
in some regions endured severe climatic fluctuations, led to an extremely 
rapid expansion in pottery production in the early Holocene, possibly 
in response to a warming climate, demographic expansion and intensi- 
fication of resource exploitation. 

Charred surface deposits were sampled from 101 Incipient Jomon 
vessels from 13 sites, dating from 15,300-11,200 cal sp (Supplementary 
Table 1) and encompassing a wide range of different environments 
across the archipelago (Fig. 1a). Potsherds from different phases (II and 
III; Fig. 1b) broadly corresponding to the Bolling-Allerad interstadial 
and the Younger Dryas stadial were included. Variations in vessel 
shapes are known to exist in this period”’* but could not be discerned 
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Figure 1 | Locations of sites sampled and temporal changes in early pottery 
production against Greenland ice-core oxygen isotope record. a, Potsherds 
were selected from assemblages at different sites, dating to chrono-typological 
phase II, that is, linear-relief wares (orange), and phase III, that is, punctate- 
marked, nail-impressed and cord-marked wares (green). At Torihama, pottery 
from both phases was sampled. The coastline corresponding to the Younger 


because of the fragmentary nature of the assemblages. Bulk carbon 
(3'°C) and nitrogen (8'°N) isotope ratio values obtained from each 
charred deposit were variable across the whole sample and between 
individual sites (Fig. 2a). Despite this variability, more than 75% of the 
samples had 3!°N values greater than 9%o (median, 10.8%o) consistent 
with the processing of high-trophic-level aquatic (that is, marine or 
freshwater) products, as has been observed experimentally (Fig. 2a) 
and on pottery from different archaeological sites associated with the 
exploitation of fish and marine mammals’*'*”. To confirm this inter- 
pretation, lipids were extracted from 57 charred interior deposits and 
four exterior deposits from Incipient Jomon vessels and analysed by 
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Figure 2 | Bulk carbon and nitrogen stable isotope ratios and fatty acid 
stable carbon isotope ratios from Incipient Jomon pottery. a, Bulk stable 
isotope values are discriminated according to site location and period. 
Torihama, orange diamonds; Shinano river sites, green circles; Sankakuyama I, 
red squares; Taisho 3, blue stars. Filled symbols indicate phase III and open 
symbols indicate phase II. The blue and green vertical bars show 95% 
confidence intervals and the median stable nitrogen isotope value from 
experimentally charred aquatic (blue, m = 9) and terrestrial (green, n = 8) 
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Dryas stadial (~65 m below present-day sea level”’) is shown. b, The Greenland 
Ice Sheet Project 2 ice-core 6'*O values (blue) generally reflect temperature, 
with lower 5'8O values corresponding to lower temperatures. Potsherd 
quantities (grey) are as previously reported”’®, calculated from 52 site reports 
and chronologically assigned by their typology and by 118 associated *C dates. 


GCMS (Table 1). Seventeen samples from the Torihama site, Western 
Honshu, and one from the Taish63 site, Hokkaido, produced dis- 
tinctive lipid profiles typically consisting of medium- and long-chain 
saturated (C,4-C,,) and mono-unsaturated (Cj6.;-Cy2.,) fatty acids, 
isoprenoid fatty acids and long-chain (Cjg—C22) w-(o-alkylphenyl) 
fatty acids (Fig. 3 and Supplementary Table 2). These profiles are 
typical of degraded aquatic oils, meeting the established criteria for 
their identification in archaeology” and providing compelling evid- 
ence that freshwater and/or marine foods were processed in these 
vessels. Such lipid profiles were not observed in the four exterior sur- 
face deposits and are highly unlikely to derive from the depositional 
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organisms (Supplementary Table 3). b, Plot of the 5C values of Cy¢.9 and Cis. 
fatty acids from charred deposits on pottery from Torihama and Taisho 3. 
These are compared with reference fats from wild terrestrial non-ruminant 
(n = 5), wild ruminant (n = 10), freshwater (n = 9) and marine (n = 17) 
organisms (Supplementary Table 4). Data are plotted with 68% confidence 
intervals (using STATISTICA v.7). Samples that produced aquatic biomarkers 
are marked with asterisks. c, Photograph of a reconstructed Incipient Jomon 
vessel from Torihama. 
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Table 1 | Summary of GCMS results of charred deposits on Incipient Jamon pottery 


Site Region Phase Associated dates* No. samples No. samples No. samples yielding 
(103 cal BP) analysed yielding lipidt aquatic biomarkers 

Kubodera minami Central Honshu ll 15.3-14.2 4 ) 0 

Taisho 3 Hokkaido ll 15.0-14.0 2 2 1 

Seiko SansoB Central Honshu Il 14.1-13.9 1 1 0) 

Sankakuyama | Kyushu ll 13.6-13.4 13 0 0 

Torihama Western Honshu Ul 12.1-11.8 35 32 17 

Muroya Cave Central Honshu Ul NA 1 0 0 

Tazawa Central Honshu Ul NA 1 0) 0 


Further details are available in Supplementary Table 2. 
*Range corresponds to +2s.d. 

+Greater than 0.1 jg mg? of food crust. 

NA, not available. 


environment because the -(o-alkylphenyl) alkanoic acids are formed 
only at high temperatures (>270 °C) from the protracted heating of 
polyunsaturated fatty acids*®. The presence of one of these biomarkers 
in two exterior deposits (Supplementary Table 2) is most probably 
attributable to overspill of the vessel’s contents. 

Further information regarding the source of the organic residues 
was obtained by measuring the stable carbon isotope ratio of two medium- 
chain-length saturated fatty acids (Cy6.9 and Cig.9) preserved in sufficient 
quantities in the charred surface deposits adhering to 12 of the Torihama 
vessels and two of those from Taisho 3. Eleven samples, including nine 
with characteristic aquatic biomarkers, yielded either Cy¢.9 or Cig.o 
fatty acids noticeably enriched in ‘°C (that is, 5'°C = —25%o). These 
5'°C values are more consistent with authentic marine products than 
with freshwater or terrestrial animals (Fig. 2b). Two samples from 
Torihama, both with aquatic biomarkers, yield 8C values closer to 
those of lipid extracts from modern pottery vessels used to prepare 
freshwater as well as non-ruminant animals from Japan (Fig. 2b), 
but their use cannot be resolved more specifically. In contrast, one 
Torihama sample (T66), which was depleted in '°N (5'°N = 6.8%), 
contained fatty acids with an isotope composition comparable to wild 
ruminant fats*’ (Fig. 2b). Aquatic biomarkers were absent from this 
sample and the lipid profile was instead dominated by saturated fatty 
acids (C,4—-C;g) that are more typical of degraded terrestrial animal fat, 
indicating a very different use history of this particular pottery vessel. 

Despite some variation in pottery use at Torihama, the majority of 
the samples, and all but one with aquatic biomarkers, had relatively 
high bulk 513C (>—26%o) and 5'°N (>9%p) values, the latter consis- 
tent with charred deposits formed experimentally from authentic 
marine organisms (Supplementary Table 3) and both comparable to 
those associated with Northern European hunter-gatherer pottery 
found at coastal sites’*. Both the bulk and the compound-specific iso- 
tope results from the earlier site of Taisho 3 (~15,000-14,000 cal Bp) 
are also consistent with the processing of marine products in pottery 
vessels (Fig. 2). Therefore, we suggest that the preparation of aquatic 
resources, particularly marine foodstuffs, was the dominant mode of 
pottery use at these sites even though neither site is situated directly on 
the coast (Fig. la). At Torihama, the nearby exposed littoral shelf 
conceivably provided easy access to a range of marine mammals and 
fish during the Late Pleistocene, even considering the slightly lower sea 
level during that period” (Fig. 1a). An alternative explanation is that 
migratory fish, such as salmon, were caught much closer to each site as 
they travelled upriver from marine feeding grounds. 

More generally, considering the 13 Incipient Jomon sites, the dis- 
tribution of 5'°N values in charred deposits associated with pots that 
date from the warmer Bolling—Allerod interstadial, when there was 
greater deciduous forest cover” and larger numbers of large mammals, is 
not significantly different (Mann-Whitney Utest) from that of the 
Younger Dryas stadial for sites on Honshu (U= 392, P=0.21, 
NBolling-Allerod — 14, NY ounger Dryas — 71) or overall (U = 893, P= 0.20, 
NBolling-Allerod = 30, Nyounger Dryas = 71). From both the bulk stable 
isotope data and the more specific product identifications based on 
available lipid data, we suggest that aquatic products were the most 
frequently processed products in Incipient Jomon pottery, through the 


fluctuating climate and across a range of environments, from Hokkaido 
in the north to Kyushu in the south. Whether ceramic vessels were 
integral to the processing of aquatic resources or, as is suggested by 
their rarity, were used only occasionally, perhaps ceremonially or as a 
prestige technology”, remains debatable. The appearance of pottery at 
Late Pleistocene sites located on narrowing coastal plains and along 
rivers where there was an abundance of terrestrial game, aquatic 
resources and fresh water** might have led to decreased mobility per- 
mitting a greater investment in pottery production for either purpose. 
The long ‘experimental’ phase of pottery use in the late glacial contrasts 
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Figure 3 | Partial gas chromatograms of a lipid extract from a charred 
deposit adhering to a Torihama potsherd (~12,000 cal Bp). a, The total ion 
chromatogram is characteristic of degraded aquatic oil and is dominated by 
medium- and long-chain saturated and mono-unsaturated fatty acids and 
isoprenoid fatty acids, pristanic, phytanic and 4,8,12-TMTD (4,8,12- 
trimethyltridecanoic acid). %,w-dicarboxylic acids (Ml) with carbon chain 
ranges of Cg—C,; were also observed and are the likely oxidation products from 
the complex mixture of long-chain precursor unsaturated fatty acids”, 
derived from freshwater or marine oils, or both. C,,.,, fatty acids with carbon 
length n and number of unsaturations x; br, branched-chain acids; IS, internal 
standard (n-hexatriacontane). b, The summed m/z 91, m/z 105 (mass-to- 
charge ratio) ion chromatogram shows the presence of «-(o-alkylphenyl) 
alkanoic acids with 16 (+), 18 («), 20 (#) or 22 (O) carbon atoms. The presence 
of the third and fourth components, thermally produced from C29 and Cz 
unsaturated fatty acids, and the distribution of their isomers confirm that this 
residue (sample T124) is derived from an aquatic organism”. 
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sharply with its sudden ubiquity in the Early Holocene of Japan 
(Fig. 1b) and other regions of East Asia’, when use intensified and 
became integrated into everyday practice, triggering further long- 
range dispersals of the new technology into other regions®. Although 
it is tempting to link this change in pottery abundance directly to 
resource intensification, increased sedentism and the warmer climate, 
more complex demographic and social factors could also be posited”®. 
Our study, showing that organic residues can be reliably obtained from 
Late Pleistocene pottery, will make it possible to clarify, through fur- 
ther widespread analysis, the drivers of the continued development of 
pottery in the Early Holocene of East Asia and its emergence at other 
innovation centres. 


METHODS SUMMARY 


Subsamples of charred surface deposits (10-30 mg) were removed with a sterile 
scalpel from 101 potsherds from 13 different sites (Supplementary Table 3 and 
Fig. 1a). The chronological phase of each sample was established by its typology, 
stratigraphic position and 4C dating of associated materials (Supplementary 
Table 1). In most cases these agree but, where available, radiocarbon dates of 
materials directly associated with the pottery analysed were favoured to assign a 
chronological phase using the published date ranges’. Lipids were extracted by 
alkali saponification with NaOH in methanol’ (5% (w/v), 2h, 70 °C) or extracted 
and methylated in one-step with acidified methanol”, or both. For the second 
process, methanol (1 ml) was added to homogenized charred deposits (10-30 mg) 
and the mixture was sonicated for 15 min and then acidified with concentrated 
sulphuric acid (200 ul). The acidified suspension was heated in sealed tubes for 4h 
at 70 °C and then cooled, and lipids were extracted with n-hexane (3 X 2 ml) and 
directly analysed by GCMS and GC-C-IRMS using standard protocols'*?**!. 
Saponified extracts were cooled, neutral lipids were removed (n-hexane, 
3 X 2 ml), the extracts were acidified with HCl, and the acid fraction was extracted 
(n-hexane, 3 X 2 ml) and methylated to produce fatty acid methyl esters (FAMEs) 
using BF;-methanol complex (14% (w/v), 200 ul, 1h, 70°C). For GC-C-IRMS, 
instrument precision on repeated measurements was +0.3%bo (s.e.m.) and the 
accuracy determined from FAME and n-alkane isotope standards was +0.5%o 
(s.e.m.). Modern samples and experimental pots were extracted using exactly the 
same procedure or as previously reported”!”’, or both. In addition, crushed surface 
residues (~1mg) were analysed by elemental analysis IRMS as previously 
reported'*. Samples yielding less than 1% N were discarded and instrument 
precision on repeated measurements was +0.2%o (s.e.m.). 5'°C, 8'°N or 880 = 
[(Reampte/Rstandara— L)] X 1,000, where R= °C/"7C, °N/"4N or '80/"8O. The 
standard for 5'°C is Vienna PeeDee Belemnite (V-PDB), the standard for 8'°N 
is air N> and the standard for '*O is Vienna Standard Mean Ocean Water 
(V-SMOW). 
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Slower recovery in space before collapse of 


connected populations 


Lei Dai’, Kirill S. Korolev! & Jeff Gore! 


Slower recovery from perturbations near a tipping point and its 
indirect signatures in fluctuation patterns have been suggested to 
foreshadow catastrophes in a wide variety of systems’*. Recent 
studies of populations in the field and in the laboratory have used 
time-series data to confirm some of the theoretically predicted 
early warning indicators, such as an increase in recovery time or 
in the size and timescale of fluctuations* °. However, the predictive 
power of temporal warning signals is limited by the demand for 
long-term observations. Large-scale spatial data are more access- 
ible, but the performance of warning signals in spatially extended 
systems’ '° needs to be examined empirically***'*. Here we use 
spatially extended yeast populations, an experimental system with 
a fold bifurcation (tipping point)®, to evaluate early warning sig- 
nals based on spatio-temporal fluctuations and to identify a novel 
spatial warning indicator. We found that two leading indicators 
based on fluctuations increased before collapse of connected popu- 
lations; however, the magnitudes of the increases were smaller than 
those observed in isolated populations, possibly because local vari- 
ation is reduced by dispersal. Furthermore, we propose a generic 
indicator based on deterministic spatial patterns, which we call 
‘recovery length’. As the spatial counterpart of recovery time’, 
recovery length is the distance necessary for connected populations 
to recover from spatial perturbations. In our experiments, recovery 
length increased substantially before population collapse, suggest- 
ing that the spatial scale of recovery can provide a superior warning 
signal before tipping points in spatially extended systems. 
Positive feedback is widespread in nature, ranging from cellular 
circuits to population growth to the melting of ice sheets. There is 
growing evidence that positive feedback leads to alternative stable 
states and tipping points in various ecological systems'*""*. Closer to 
a tipping point an ecosystem becomes less resilient and more likely to 
shift to an alternative state’? such as the collapse of fish stocks, eutro- 
phication of lakes, or loss of vegetation’’. Predicting these undesirable 
transitions may sound like an impossible task because of the inherent 
complexity underlying these systems. However, recent advances incor- 
porating ideas from nonlinear dynamical systems theory suggest that 
there may be signatures of “critical slowing down” in the vicinity of 
tipping points’’. At the brink of these sudden transitions, the recovery 
of a system after perturbations should slow down”, also leading to 
changes in the pattern of fluctuations”’. Thus, a set of indicators related 
to critical slowing down may provide advance warning of an impend- 
ing transition. Empirical tests in the field* and in the laboratory*** 
have revealed some of the early warning signals based on fluctuations 
in time series, such as temporal variation and autocorrelation. 
However, our understanding of early warning signals in spatially 
extended systems is still limited'*. The studies in time series typically 
ignore spatial interactions; in reality spatial coupling between habitat 
patches (for example, dispersal of populations or exchange of biomass) 
is common and may affect the performance of some warning signals”. 
Moreover, temporal warning signals rely on data from long-term obser- 
vations, which are scarce and difficult to obtain. Large-scale spatial data, 
such as satellite-derived data sets'’, could be more readily available. 


Spatial data not only provide a greater quantity of information, but 
also allow us to study features of the system that are not available 
through time series. Statistical indicators based on spatial fluctuations 
have been proposed’’° but empirical studies are limited*’""’; tests of 
these indicators in replicated experiments, which avoid the bias intro- 
duced by selective sampling”, are lacking. In addition, previous studies 
of vegetation systems discovered emerging spatial patterns preceding 
transitions***. However, the vegetation patterns are often specific to 
the system studied, whereas identifying generic spatial warning signals 
would add a powerful tool to the analysis of ecosystem stability. Here 
we address these questions using an experimental system of spatially 
extended yeast populations with alternative stable states and a tipping 
point leading to population collapse. 

We grew laboratory populations of the budding yeast Saccharomyces 
cerevisiae in sucrose and performed daily dilution into fresh media. 
During the daily dilution, a fraction (for example, 1 in 500 for a dilution 
factor of 500) of the cells were transferred to fresh media. This is a well 
characterized system with an experimentally mapped fold bifurcation 
(tipping point)*. Yeast cells grow cooperatively in sucrose by sharing the 
hydrolysis products”, creating a positive feedback between cells that 
leads to bistability and a tipping point (Supplementary Fig. 1). By 
increasing the dilution factor (equivalent to an increase in the mortality 
rate), we could drive isolated yeast populations to collapse on crossing 
the tipping point (Fig. 1a). 

We then connected local yeast populations spatially through con- 
trolled dispersal between nearest neighbours on a one-dimensional 
array (Fig. 1b). Spatial coupling between local populations was intro- 
duced by adding a dispersal step during the daily dilution. In the dis- 
persal step, 25% (corresponding to a dispersal rate D = 2 X 25% = 0.5) 
of a local population was transferred to each of its nearest neighbours; 
the rest of the population remained in the patch. For each dilution 
factor, there were four replicate arrays each consisting of ten patches. 
A group of isolated populations (D = 0) was grown in a similar experi- 
mental setting except that there was no mixing between neighbours 
(Methods). The isolated populations served as a control group and 
allowed us to investigate the effects of spatial coupling on warning 
signals. From dilution factor 500 to 1,600, both groups of connected 
and isolated populations survived and reached equilibrium densities in 
a week; at dilution factor 1,700, most of the populations collapsed 
within the timescale of our experiment (insets to Fig. 1a). 

After the populations stabilized, we tracked the fluctuations of popu- 
lation density around equilibrium for at least five days to calculate 
statistical indicators (Methods). Consistent with critical slowing down, 
we observed a clear increase in the coefficient of variation (CV) of con- 
nected populations towards the tipping point (Fig. 2a); however, the 
magnitude of increase in CV was smaller than in the isolated popula- 
tions. We then tested lag-1 autocorrelation, a leading indicator for the 
temporal correlation of fluctuations. As expected, we found that the 
temporal correlation of connected populations increased gradually to 
around 0.6 in the vicinity of the tipping point (Fig. 2b). Similar to the 
observation in CV, the signal in temporal correlation was weaker than 
in the isolated populations. Although fluctuations of population density 
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Figure 1 | Yeast populations with a tipping point: an experimental system to 
study the collapse of connected populations. a, Isolated yeast populations 
collapse after crossing a tipping point. The distribution of population density 
around equilibrium is shown in spread points; the red square denotes the mean. 
Insets are traces of replicate populations at dilution factor 1,000 (stable) and 
1,700 (collapsed). b, Yeast populations are spatially connected by controlled 
daily dispersal. Each circle corresponds to a habitat patch where a local 
population grows. A fraction of the local population is transferred to each of its 
two nearest neighbours, and the rest to itself. 


in general became larger and more correlated before population col- 
lapse, we found that these two warning signals (CV and lag-1 autocor- 
relation) seemed to be suppressed in the presence of dispersal, especially 
at higher dilution factors. 

One explanation for the observed suppression of the two leading 
indicators in connected populations is that flows between neighbours 
smooth out the fluctuations across different patches and effectively 
reduce the autocorrelation in any local population. Reduced size or 
timescale of fluctuations due to dispersal among populations was pre- 
dicted in previous theoretical studies of spatially explicit ecological 
models*’°**?”, We note that the smaller warning signals of connected 
populations in our experiment may be partly due to a minor shift in the 
tipping point (Supplementary Fig. 2). The averaging effect of dispersal 
was also found in an independent group of populations subject to 
“100% dispersal treatment’, in which we mixed ten populations com- 
pletely each day during the dispersal step. In this extreme scenario, the 
populations showed almost no increase in variation before the tipping 
point (Supplementary Fig. 3). Moreover, we demonstrated the sup- 
pression of CV and lag-1 autocorrelation by dispersal in analytical 
derivations based on a spatially explicit first-order autoregressive 
model (Supplementary Note 1) and in stochastic simulations using a 
phenomenological model of yeast growth® (Supplementary Fig. 4). 
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Figure 2 | Early warning signals based on fluctuations show suppressed 
increase in connected populations. a, Coefficient of variation (CV). 

b, Temporal correlation (lag-1 autocorrelation). The coefficient of variation 
and temporal correlation of both isolated populations (red squares) and 
connected populations (blue circles) increased before the tipping point. The 
signals were suppressed in the connected populations, possibly owing to the 
averaging effect of dispersal. Error bars are standard errors given by bootstrap 
for isolated populations and standard errors of the mean (n = 4) for connected 
populations. 


Spatial coupling introduces the possibility of another warning indi- 
cator based on spatial fluctuations: spatial correlation. Long-range 
spatial correlation has been known to occur in the vicinity of some 
phase transitions**; recent theoretical work in spatially explicit eco- 
logical models found that increasing spatial correlation could be a 
warning signal before transitions to an alternative stable state*. We 
tested the two-point correlation between nearest neighbours in the 
connected populations but failed to observe any increase near the 
tipping point (Supplementary Fig. 5). Simulation results with varying 
sample size showed that no statistically significant increase in spatial 
correlation should be discerned with the limited samples in our experi- 
ment. Thus, our results suggest that to observe the increase in spatial 
correlation may require more data than for other indicators. 

Facing the potential difficulty of observing a strong warning signal 
based on fluctuations in spatially connected populations, we set out to 
look for possible new indicators. The existing warning signals can be 
classified into different categories, based on the nature of the perturba- 
tions and measurements (see Fig. 3). Measuring the recovery time after 
a pulse perturbation (Fig. 3a) can provide a robust indicator of the 
distance to a tipping point’. In large complex systems, it is often 
impractical to perform such temporal perturbations repeatedly and 
measure recovery time. However, owing to stochastic perturbations 
such as demographic noise, population density constantly fluctuates 
around the equilibrium. Changes of fluctuation patterns such as an 
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Figure 3 | Early warning signals can be classified into four categories by the 
nature of perturbations and measurements. a, Recovery time; b, Recovery 
length; c, Statistical indicators based on temporal fluctuations; d, Statistical 
indicators based on spatial fluctuations. The unexplored category of early 
warning signals is the spatial counterpart of recovery time: ‘recovery length’. 
The recovery length characterizes the spatial scale over which population 
density recovers from a pulse perturbation in space, such as at a boundary with 
a region of lower quality (b). The recovery length increases towards the tipping 
point (Supplementary Note 2) and provides a novel indicator of critical slowing 
down in spatial data. 


increase in variation and correlation (Fig. 3c, d), measured either in 
time or in space, are also signatures of critical slowing down and 
consist of another two categories of leading warning signals**°'°7". 
Surprisingly, there is one remaining category that has not been pro- 
posed: find or create a ‘pulse perturbation in space’ (Fig. 3b) and 
measure the spatial counterpart of recovery time’. Adjacent to a 
region of poor quality, the neighbouring good patches will not imme- 
diately have reached their carrying capacity; instead the carrying capa- 
city will be reached only further from the bad region (Supplementary 
Fig. 6). Rather than an increase in the timescale to recover, critical 
slowing down here manifests itself as an increase in the spatial scale to 
recover (Supplementary Note 2), that is, an increase in ‘recovery 
length’ as compared to ‘recovery time’. 
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To test our ‘recovery length’ hypothesis, we performed another set 
of experiments with spatially connected yeast populations (dispersal 
rate D = 0.5), now with two different regions: a relatively good (lower 
dilution factor) region of five patches and a bad (high dilution factor) 
region of one patch (Fig. 4a). Given this sharp boundary between two 
regions of different quality, population density in the good region 
recovered gradually in space to the equilibrium value. As the condition 
of the good region deteriorates, we expect an increase in the spatial 
scale over which the populations recover. Indeed, we observed a clear 
change in the steady-state recovery profile of populations with increas- 
ing dilution factor of the good region (Fig. 4b). In agreement with our 
hypothesis, the spatial recovery spanned a much longer distance closer 
to the tipping point. 

We quantified this spatial scale using two different indicators 
(Fig. 4c). The first indicator, the ‘half-point recovery length’, measures 
the distance between the bad region and the location of half recovery 
(Methods). The half-point recovery length increased gradually with 
dilution factor from less than 0.5 to around 2. The second indicator, 
the ‘exponential recovery length’, is obtained by fitting the recovery 
profile with an exponential function (Methods). Similar to the first 
indicator, the exponential recovery length increased more than three- 
fold as the tipping point was approached. Thus, both measures suggest 
that the recovery length provides a strong warning signal before popu- 
lation collapse in our system. We also observed an increase in both 
indicators as we slowly caused the good region to deteriorate by 
increasing the dilution factor and induced the collapse of connected 
populations in real time (Supplementary Fig. 7). 

Recovery length completes the four categories of early warning sig- 
nals and can help improve our assessment of spatially extended systems. 
Our results suggest that stronger spatial coupling (higher dispersal rate) 
suppresses early warning signals in CV and temporal correlation 
(Supplementary Fig. 4). In contrast, the magnitude of recovery length 
increases with the level of spatial coupling (Supplementary Note 2). 
These two categories of early warning signals are therefore comple- 
mentary: when one signal is weak the other is strong. Also, although 
our experiment was conducted on a linear array, the use of recovery 
length can be readily generalized to two-dimensional systems by map- 
ping the profile perpendicularly to contours of population density. 
Unlike the specific spatial patterns found in two-dimensional vegeta- 
tion systems**”, recovery length may provide a generic measure given 
that the spatially coupled units by themselves would recover more 
slowly near the tipping point. 

Finally, from a practical perspective, boundaries between regions 
of different quality are ubiquitous in nature, thus providing many 
opportunities to measure the recovery length in populations of inter- 
est. One specific example of recovery length would be the “distance of 
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Figure 4 | Recovery length provides a direct measure of critical slowing 
down in space. a, Connected populations in relatively good regions of dilution 
factor 750 (left) and 1,200 (right) recover gradually in space to the equilibrium 
density. Blue circles denote the steady-state profile of population density after 
averaging over replicates (shown in grey). b, Recovery profiles at dilution factor 
500, 750, 1,000, 1,200, 1,350 and 1,400 show an increasing spatial scale of 
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recovery. The profile is normalized by the population density of the patch 
furthest from the bad region. Lines are shape-preserving interpolations; the 
position of half-recovery is marked bya red square. c, Two different measures of 
recovery length increase substantially with dilution factor. Error bars are 
standard errors given by bootstrap. 
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edge influence” in landscape ecology”: it quantifies the spatial scale of 
edge influence on biota in fragmented landscapes. Data of edge influ- 
ence for forests at different sites suggest a longer recovery length in the 
Australian tropical forests*®, which seems to be borne out by the recent 
forest collapses in Western Australia. In principle, the recovery length 
can also be measured when spatially extended populations ‘recover’ 
from a region of higher quality (Supplementary Figs 6 and 8), suggest- 
ing that boundaries may be introduced by conservation efforts (for 
example, setting up marine reserves). 

Our experiments were performed in the simplest spatial setting pos- 
sible: homogeneous environments and dispersal rates, a large popu- 
lation size and a safe distance away from the tipping point. In the 
presence of environmental heterogeneity, measurement of recovery 
length may fail if the desired sharp boundary between regions of dif- 
ferent quality is blurred. However, in such a case we would expect 
enhanced signals in spatial correlation® and spatial variation before 
population collapse (Supplementary Fig. 9). Our experiments have also 
not explored the effects of spatial coupling on the global stability of a 
meta-population. On the one hand, spatial coupling may reduce fluc- 
tuations and the probability that a random shock will trigger a state 
shift?’; on the other hand, stochastic local extinctions or the introduc- 
tion of a bad region may drive the connected populations to collapse 
before the tipping point of a local population (Supplementary Note 3). 

Our work illustrates the important role of spatial coupling, such as 
the dispersal of populations, in our understanding of how to apply the 
current toolbox of warning indicators to natural populations. More 
empirical studies are required to confirm the generality and applic- 
ability of different indicators, but being able to observe warning signals 
in connected populations suggests that we may be able to develop 
quantitative metrics for assessing the fragility of spatially extended 
complex systems. 


METHODS SUMMARY 


We grew the budding yeast Saccharomyces cerevisiae in 200 ll batch culture on BD 
Falcon 96-well Microtest plates at 30.7°C (+0.2°C, standard deviation) using 
synthetic media supplemented with 2% sucrose’. Serial dilutions were performed 
daily with variable dilution factors. Population densities were recorded each day 
before the serial dilution by measuring optical density at 600 nm. Statistical indi- 
cators were calculated after the populations stabilized. The coefficient of variation 
was calculated as the sample standard deviation divided by the sample mean. The 
temporal correlation was estimated by the Pearson’s correlation coefficient 
between the population densities at subsequent days. In the experiment to measure 
recovery length, the half-point recovery length was estimated by performing a 
shape-preserving interpolation to the steady-state recovery profile of population 
density and then locating the position of half-recovery. The exponential recovery 
length was estimated by fitting an exponential function to the recovery profile. For 
further details, see the online-only Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Experimental protocols. We grew the budding yeast Saccharomyces cerevisiae in 
200 pl batch culture on BD Falcon 96-well Microtest plates at 30.7 °C (+0.2 °C, 
standard deviation) using synthetic media (yeast nitrogen bases + nitrogen, 
Complete Supplement Mixture) supplemented with 2% sucrose®. Cultures were 
maintained in a well-mixed condition by growing on a shaker at 825 r.p.m. Serial 
dilutions were performed daily (23h of growth) with variable dilution factors. 
Population densities were recorded each day before the serial dilution by mea- 
suring optical density at 600nm using a Thermo Scientific Varioskan Flash 
Multimode Reader. The calibration between optical density and cell density was 
based on the previous characterization of this system’. 

In the group of connected populations, for each dilution factor there were four 

replicate arrays each consisting of ten patches. Populations were connected by 
controlled dispersal between nearest neighbours (dispersal rate D = 0.5, which is 
defined as the fraction of population going out of a patch). Reflecting boundary 
conditions were adopted, meaning that a population on the edge would have 75% 
of its cells remaining in the patch during the dispersal step. In the group of isolated 
populations, the experiment was performed in a similar spatial setting except that 
there was no dispersal (D = 0); for each dilution factor there were four arrays each 
consisting of five patches, giving a total of 20 replicate populations isolated from 
each other. The dilution factors for the data presented in Fig. 2 are 500, 1,000, 
1,200, 1,400 and 1,600. In the experiment to measure recovery length, populations 
were connected by nearest-neighbour dispersal (D = 0.5, reflecting boundary con- 
ditions). The dilution factor for the bad region (one patch) was 2,500; the dilution 
factor for the good region (five patches) was varied as the environmental driver. 
The dilution factors for the data presented in Fig. 4 are 500, 750, 1,000, 1,133, 
1,200, 1,266, 1,350 and 1,400. 
Calculation of statistical indicators. Statistical indicators for the connected 
populations were calculated among ten populations in one array on each day 
and averaged over a span of at least five days, after the populations stabilized. 
The mean value of four replicate arrays and the standard error of the mean ( = 4) 
are shown in Fig. 2. For the isolated populations, statistical indicators were calcu- 
lated on each day among 20 populations over five days. We used bootstrap to 
compute the standard errors of the indicators by resampling 1,000 times the 
ensemble of replicate populations (for the coefficient of variation and the temporal 
correlation) or arrays (for the spatial correlation). 

The coefficient of variation (CV) was calculated as the sample standard devi- 
ation (Supplementary Fig. 3c) divided by the sample mean. Because the local 
populations in our experiment were grown in a homogeneous environment, in 
principle they could all be treated as replicates. Assuming the system is ergodic, the 
CV calculated over an ensemble of replicates can be interpreted either as the spatial 
CV of many populations at one time point or the temporal CV of a single popu- 
lation over many time points. The temporal correlation, defined as the lag-1 
autocorrelation, was estimated by the Pearson’s correlation coefficient between 
the population densities at subsequent days. To correct for negative bias in small 
samples, we used a modified estimator with an additional term 1/N for lag-1 
autocorrelation*'. The sample size N=10 for connected populations and 
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N = 20 for isolated populations. N is a fixed number for different dilution factors, 
so using the modified estimators would not affect the trend of indicators. The 
spatial correlation, defined as the two-point correlation between all neighbouring 
pairs, was estimated by the Moran’s coefficient***. The expectation of Moran’s 
coefficient is —1/(N- 1) in the absence of spatial correlation*’; we used a modified 
estimator with an additional term 1/(N - 1) so that the expectation is 0. In this case, 
the sample size N is the number of patches in an array: N= 10 for connected 
populations and N=5 for isolated populations. For detailed formulae of the 
statistical indicators, see Supplementary Note 4. 

In the analysis we ensured environmental homogeneity by removing the linear 

gradient of population density observed in connected populations. This small 
gradient is presumably caused by some heterogeneity in experimental conditions 
(temperature, dilution errors, and so on) across the plate. Removing gradient-type 
spatial heterogeneity before statistical analysis is similar to the detrending proced- 
ure commonly used in time-series analysis; it prevents spurious signals such as 
positive spatial correlation (Supplementary Fig. 10). 
Recovery length. After the recovery profile stabilized, we tracked the population 
density profiles of at least six replicates over several days. The half-point recovery 
length Lyaig was estimated by performing a shape-preserving interpolation (Matlab 
function PCHIP, piecewise cubic Hermite interpolating polynomial) to the recov- 
ery profile and then locating the position of half-recovery, at which 
n(x = Lai) = (1/2)n(x = 5). The population density of the bad region (dilution 
factor 2,500) in our experiment was close to 0 (Fig. 4 and Supplementary Fig. 7). In 
the more general scenario with a sharp boundary between two regions of different 
quality (Supplementary Figs 6 and 8), the position of half-recovery can be defined 
as the midpoint between the equilibrium population density of the region of 
interest and the population density at the boundary. 

The exponential recovery length L,,, was estimated by fitting an exponential 
function with three parameters cjexp(—x/Lexp) + c2 to the recovery profile n(x). 
The data points used for exponential fitting are from positions 1 to 5 (except for 
dilution factor 500, the data for fitting are from positions 0 to 5). We note that our 
definition of exponential recovery length is phenomenological, because: (1) the 
deviation is expected to be exponential only close enough to the equilibrium; 
(2) at higher dilution factors the profile can deviate from an exponential form 
(Supplementary Fig. 11). The ‘kink’ in the fitted exponential recovery length 
(Fig. 4c) may be due to the limited data points used in fitting or experimental 
errors. For both the half-point recovery length and the exponential recovery 
length, we used bootstrap to compute standard errors for the indicators by resam- 
pling the ensemble of steady-state profiles 100 times and fitting the average recov- 
ery profile. 
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Rescuing cocaine-induced prefrontal cortex 
hypoactivity prevents compulsive cocaine seeking 


Billy T. Chen!, Hau-Jie Yau!, Christina Hatch!, Ikue Kusumoto- Yoshida‘, Saemi L. Cho’, F. Woodward Hopf? 


& Antonello Bonci!** 


Loss of control over harmful drug seeking is one of the most intract- 
able aspects of addiction, as human substance abusers continue to 
pursue drugs despite incurring significant negative consequences’. 
Human studies have suggested that deficits in prefrontal cortical 
function and consequential loss of inhibitory control’* could be 
crucial in promoting compulsive drug use. However, it remains 
unknown whether chronic drug use compromises cortical activity 
and, equally important, whether this deficit promotes compulsive 
cocaine seeking. Here we use a rat model of compulsive drug seek- 
ing** in which cocaine seeking persists in a subgroup of rats despite 
delivery of noxious foot shocks. We show that prolonged cocaine 
self-administration decreases ex vivo intrinsic excitability of deep- 
layer pyramidal neurons in the prelimbic cortex, which was sig- 
nificantly more pronounced in compulsive drug-seeking animals. 
Furthermore, compensating for hypoactive prelimbic cortex neurons 
with in vivo optogenetic prelimbic cortex stimulation significantly 
prevented compulsive cocaine seeking, whereas optogenetic prelimbic 
cortex inhibition significantly increased compulsive cocaine seeking. 
Our results show a marked reduction in prelimbic cortex excitability 
in compulsive cocaine-seeking rats, and that in vivo optogenetic pre- 
limbic cortex stimulation decreased compulsive drug-seeking beha- 
viours. Thus, targeted stimulation of the prefrontal cortex could serve 
as a promising therapy for treating compulsive drug use. 

To investigate the neurobiological dysfunctions that underlie com- 
pulsive drug use, we used an animal model of addiction previously 
shown to engender a subpopulation of rats with compulsive cocaine 
seeking. Rats underwent extended cocaine self-administration training 
(Fig. 1a) under a heterogeneous seeking-taking chain schedule® (details 
in the Supplementary Information). Daily self-administration sessions 
consisted of 30 trials that began with insertion of the seek lever. After 
one lever press and a variable interval (5 to 75 s), one additional press 
on the seek lever resulted in seek lever retraction and take lever 
insertion. One take lever press triggered a single intravenous cocaine 
infusion and retraction of the take lever. After >8 weeks of self- 
administration training, rats received four days of shock sessions in 
which 30% of the seek link ended with delivery of a noxious foot shock 
(0.4 mA, 0.5s). Rats earning = 10 cocaine infusions (14 of 46 rats, 30%, 
24.1 + 1.9 infusions) in the fourth shock session were considered 
shock-resistant, whereas rats earning <10 infusions (32 of 46 rats, 
70%, 1.3 + 0.3 infusions) were considered shock-sensitive (Supplemen- 
tary Fig. 1). Previous works have observed a relatively similar propor- 
tion of shock-resistant rats** and suggested that these rats exhibit 
greater basal impulsivity but not higher reactivity to novelty®. Seek lever 
presses (Fig. 1b, c) and earned cocaine infusions (Fig. 1d, e) were sig- 
nificantly inhibited by foot shocks in shock-sensitive rats, whereas 
shock-resistant rats were much less affected (Fig. 1b-e). Baseline 
cocaine-seeking behaviours and earned cocaine infusions were similar 
in both groups (Fig. 1c, e), suggesting that divergent responses to con- 
tingent foot shocks did not reflect cocaine self-administration history’. 


Clinical evidence implicates prefrontal cortex (PFC) hypofunction”* 
in the loss of inhibitory control over drug seeking, but a causal link 
between drug-induced neuroadaptations and compulsive seeking beha- 
viour remains elusive. We focused on the prelimbic cortex, because a 
homologous region in the human PFC?"® regulates many cognitive 
functions, including decision making and inhibitory response con- 
trol’’’°, which are compromised in human addicts’. We propose that 
chronic cocaine use induces prelimbic cortex hypoactivity, and that 
compromised prelimbic cortex function in turn impairs inhibitory 
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Figure 1 | Prolonged cocaine self-administration induces compulsive 
cocaine seeking in a subpopulation of rats. a, Experimental timeline showing 
the seek-take task and long-access sessions before the beginning of the foot 
shock sessions. After approximately two months of cocaine self-administration, 
rats received four days of shock sessions, in which a foot shock was 
administered in 30% of trials. b, Foot shocks decreased seek-lever presses in 
shock-sensitive rats (n = 32) but not shock-resistant rats (n = 14). c, Seek-lever 
presses on last day of baseline and last day of shock (arrowheads in b) 

(Fa,6s) = 27.89, P< 0.001; Tukey’s post-hoc: ***P < 0.001 baseline versus 
shock sessions; +P < 0.001 shock-sensitive versus shock-resistant shock 
session). d, Foot shocks greatly reduced cocaine infusions in shock-sensitive 
rats. e, Earned cocaine infusions (inf) on last day of baseline and last day of 
shock (F(3,68) = 246.8, P< 0.001; Tukey’s post-hoc: ***P < 0.001 baseline 
versus shock session; +P < 0.001 shock-sensitive versus shock-resistant shock 
sessions). NS, not significant. Error bars indicate s.e.m. 
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control over compulsive drug seeking. Thus, we performed ex vivo 
electrophysiology to determine whether cocaine self-administration 
reduced prelimbic cortex intrinsic excitability, because action potentials 
are the main mechanism of neuronal communication and altered spike 
firing contributes to neuronal dysfunction’® and central nervous system 
disorders’’. 

Intrinsic excitability of deep-layer, morphologically confirmed 
(Fig. 2a)'* pyramidal prelimbic cortex neurons from shock-sensitive 
and shock-resistant rats was evaluated 24h after the fourth shock 
session, and was compared with naive rats. We targeted deep-layer 
pyramidal prelimbic cortex neurons because they project to brain 
structures implicated in drug-seeking behaviours, including the nuc- 
leus accumbens, dorsal striatum and amygdala’*”’”°. We first mea- 
sured the amount of current required to induce an action potential in 
response to brief depolarization (2 ms, 0-2,500pA, 10pA steps). 
Relative to naive and shock-sensitive neurons, shock-resistant neurons 
required almost twice as much current to elicit an action potential 
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Figure 2 | Profoundly hypoactive prelimbic cortex neurons in shock- 
resistant rats. a, Morphological features of a recorded prelimbic cortex 
pyramidal neuron. Scale bars represent 0.1 mm and 1 mm in the left and right 
images, respectively. b, Shock-resistant prelimbic cortex neurons required 
more current to elicit firing with a 2-ms current step. Top, examples of sub- 
threshold membrane depolarization and the first action potential elicited by 
current injection. Square pulses illustrate increasing current injection. Bottom, 
current required for firing was higher in shock-resistant rats (F(3,45) = 18.53, 
P<0.001; Tukey’s post-hoc: ***P < 0,001 naive or shock-sensitive versus 
shock-resistant) (naive, n = 10 cells; shock-sensitive, n = 13 cells; shock- 
resistant, n = 16 cells). c, Top, neuronal responses to hyperpolarizing steps. 
Bottom, input resistance was decreased only in shock-resistant rats 

(FG,60) = 3.986, P = 0.011; Tukey’s post-hoc: +P < 0.01 naive versus shock- 
resistant, **P < 0.01 shock-sensitive versus shock-resistant) (naive, n = 12 
cells; shock-sensitive, n = 24 cells; shock-resistant, n = 16 cells). d, Prelimbic 
cortex firing in response to current injection. e, Firing frequency versus current 
injection (1s duration, 0 to 500 pA, 50 pA steps) (two-way analysis of variance 
(ANOVA) among all groups: treatment, F(3,59, = 17.45, P< 0.001; interaction, 
F(30,590) = 11.95, P< 0.001; two-way ANOVA between shock-resistant and 
shock-sensitive: treatment, Fa, 33) = 6.440, *P = 0.016; interaction group X 
input, F(10,330) = 6.201, P< 0.001) (naive, n = 15 cells; shock-sensitive, n = 19 
cells; shock-resistant, n = 16 cells). Statistical analyses include neurons from 
rats in Supplementary Fig. 5. Error bars indicate s.e.m. 
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(Fig. 2b) and exhibited significantly smaller input resistance 
(Fig. 2c). Thus, the ability of prelimbic cortex neurons to rapidly 
generate an action potential was compromised in shock-resistant rats. 
In addition, prelimbic cortex neurons from shock-resistant and shock- 
sensitive rats showed impairments in repetitive firing induced by 
a more moderate, sustained depolarizing currents (1s, 0-500 pA, 
50 pA steps), with a significantly greater impairment in shock-resistant 
versus shock-sensitive rats (Fig. 2d, e). Also, cocaine intake under 
shock correlated with decreased input resistance and increased current 
required for firing across all rats tested (Supplementary Fig. 2). 
Analysis of the action potential shape during whole-cell recordings 
and nucleated patch electrophysiological recordings revealed no dif- 
ferences in Na’ channel activation, or in the composition of somatic 
K™ currents or barium-sensitive KY conductances (Supplementary 
Table 1 and Supplementary Figs 3 and 4) among naive, shock-sensitive 
and shock-resistant rats. Control experiments in which shock- 
sensitive rats self-administered additional cocaine after each shock 
session suggested that the quantity of cocaine per se was not respon- 
sible for firing adaptations in shock-resistant rats (Supplementary 
Fig. 5). 

Thus, our results demonstrate that long-term cocaine self- 
administration reduced prelimbic cortex excitability, with a much 
more pronounced effect in compulsive rats. If compromised spike 
firing was causally related to compulsive cocaine seeking, then increas- 
ing prelimbic cortex neuronal activity could decrease compulsive 
behaviour. One way to compensate for reduced excitability is to take 
advantage of in vivo optogenetic stimulation*'”’, in which prelimbic 
cortex neurons can be activated in a temporally precise manner during 
cocaine self-administration trials. Deep-layer prelimbic cortex neurons 
expressing channelrhodopsin-2 (ChR2) (Supplementary Fig. 6a) 
showed minimal loss of spike fidelity at photostimulation frequencies 
up to 20 Hz both ex vivo and in vivo (Supplementary Fig. 6b, c). Also, 
1 Hz prelimbic cortex photoactivation evoked field potentials in the 
nucleus accumbens core, a downstream prelimbic cortex target”*, with- 
out loss of fidelity or generation of long-term depression (LTD) of 
glutamatergic function (Supplementary Fig. 7), although other frequen- 
cies can induce prelimbic cortex LTD**. Thus, photoactivation of 
ChR2-expressing prelimbic cortex neurons can be reliably achieved at 
physiologically relevant timescales. 

To test whether enhancing activity in a hypoactive prelimbic cortex 
would decrease compulsive cocaine seeking, adeno-associated virus 
(AAV) encoding ChR2 fused to enhanced yellow fluorescent protein 
(ChR2-eYFP) was bilaterally injected into the prelimbic cortex, with 
bilateral implantation of chronic optic fibres” targeting the prelimbic 
cortex (Supplementary Fig. 8). The effect of prelimbic cortex stimu- 
lation on cocaine seeking was assessed in two separate sessions 
(Fig. 3A), and was only determined in the compulsive cocaine-seeking 
shock-resistant rats. The first prelimbic cortex ChR2 stimulation was 
administered during the last baseline session to determine the effect on 
control cocaine seeking. The second ChR2 stimulation session was 
given 24h after the fourth shock session to determine whether prelim- 
bic cortex activation could alter compulsive cocaine seeking; this session 
was identical to shock sessions except with the addition of photostimu- 
lation. In both ChR2 stimulation sessions, 1 Hz light pulses (10 ms 
duration, 473 nm) were delivered to the prelimbic cortex starting at 
the seek lever extension and terminating at the conclusion of the seek 
link. Prelimbic cortex stimulation had no effect on control, baseline 
cocaine-seeking behaviours (Fig. 3B, a-e). Similarly, in vivo prelimbic 
cortex photoactivation did not promote motivated behavioural 
responding (Supplementary Fig. 9). 

In stark contrast to the baseline ChR2 stimulation session, prelimbic 
cortex photoactivation by ChR2 significantly attenuated foot-shock- 
resistant cocaine seeking. Photostimulation decreased earned cocaine 
infusions and increased the latency to seek-lever press (Fig. 3C, a 
and b), with a trend for decreased seek-lever presses and increased 
inter-press interval (Fig. 3C, c-e). Thus, restoring prelimbic cortex 
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Figure 3 | In vivo optogenetic stimulation of prelimbic cortex suppresses 
compulsive cocaine seeking. A, Timeline illustrating when cocaine-seeking 
behaviours were analysed in shock-resistant rats. B, C, Comparisons were made 
between the last baseline session and ChR2 control session (B), and between 
the fourth shock session and the ChR2 plus shock session 24 h later (C). 

B, a-e, Prelimbic cortex photoactivation did not alter baseline cocaine self- 
administration. C, Light stimulation suppressed compulsive cocaine-seeking 


activity decreased compulsive cocaine seeking only after rats had 
experienced foot shocks. In addition, ChR2-mediated activation of 
prelimbic cortex reduced compulsive responding starting from the 
beginning of the cocaine-intake session (Supplementary Fig. 10), 
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behaviours, with decreased cocaine rewards earned (a, baseline/shock 

Fas) = 6.447, P = 0.052; ChR2/no ChR2: Fi,,5) = 13.998, P = 0.013; 
interaction: F(,s) = 17.699, P = 0.008; Tukey’s post-hoc: *P < 0.05) and increased 
latency to press the seek lever (b, baseline/shock Fi;5) = 5.756, P = 0.062; ChR2/ 
no ChR2: Fy,5) = 13.297, P = 0.015; interaction: Fis) = 30.304, P = 0.003; 
Tukey’s post-hoc: *P < 0.05). A trend was observed in seek-lever presses (c, d) and 
inter-press interval (e). n = 6 rats. Error bars indicate s.e.m. 


suggesting that reduced intake may not require learning or other pro- 
cesses which might develop across the cocaine-intake session. 

On the basis of these results, we proposed that prelimbic cortex 
inhibition might promote compulsive intake in shock-sensitive rats. 
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Figure 4 | In vivo optogenetic inhibition of prelimbic cortex enhances 
compulsive cocaine seeking. A, Timeline illustrating when cocaine-seeking 
behaviours were analysed. B, C, Comparisons were made between the last 
baseline session and eNpHR3.0 control session (B), and between the fourth 
shock session and the eNpHR3.0 plus shock session 24 h later (C). 

B, a-e, Prelimbic cortex photoactivation did not alter baseline cocaine self- 
administration. C, Light stimulation promoted cocaine-seeking behaviours in 
shock-sensitive rats, including increased cocaine rewards earned (a, baseline/ 
shock Fy,3) = 126.99, P = 0.001; eNpHR3.0/no eNpHR3.0: Fi,,3) = 10.038, 
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P= 0.051; interaction: F,,,3) = 12.271, P = 0.039), decreased latency to press 
seek lever (b, baseline/shock Fi; 3) = 118.59, P = 0.002; eNpHR3.0/no 
eNpHR3.0: Fy,,3) = 17.269, P = 0.025; interaction: Fy,3) = 10.751, P = 0.046) 
and decreased inter-press interval (e, baseline/shock Fi;,3) = 124.12, P = 0.002; 
eNpHR3.0/no eNpHR3.0: Fi,,3) = 12.941, P = 0.037; interaction: 

Fu,3) = 10.642, P = 0.047). A trend was observed in seek-lever presses 

(c, d). n = 4 rats. NS, not significant. *P < 0.05 Tukey’s post-hoc. Error bars 
indicate s.e.m. Halo, halorhodopsin (optogenetic inhibitor). 
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Thus, AAV encoding halorhodopsin-eYFP (eNpHR3.0-eYFP)*’” was 
bilaterally injected into the prelimbic cortex, with bilateral chronic optic 
fibres targeting the prelimbic cortex (Supplementary Fig. 11). Prelimbic 
cortex inhibition by eNpHR3.0 was verified ex vivo (Supplementary 
Fig. 12). The effect of prelimbic cortex inhibition was assessed in two 
sessions with identical schedules as in the ChR2 experiments (Fig. 4A), 
except that neurons were inhibited by a continuous light pulse (532 nm) 
during the seek part of the seek-take chain. Prelimbic cortex inhibition 
had no effect on control, baseline cocaine-seeking behaviours (Fig. 4B, 
a-e), but significantly increased cocaine seeking during foot shock ses- 
sions in these shock-sensitive rats, with greater earned cocaine infusions 
(Fig. 4C, a), decreased latency to seek-lever press (Fig. 4C, b), decreased 
inter-press interval (Fig. 4C, e) and a trend for increased seek-lever 
presses (Fig. 4C, c and d). Thus, eNpHR3.0 inhibition of prelimbic 
cortex neurons increased propensity for compulsive cocaine seeking. 
Our results suggest that prolonged cocaine use depressed prelimbic 
cortex excitability, and that, in a select population of rats, a profound 
prelimbic cortex hypoactivity drove compulsive cocaine seeking. We 
speculate that crossing a critical threshold of prelimbic cortex hypoac- 
tivity promotes compulsive behaviours. Notably, prelimbic cortex- 
mediated inhibitory control was only evident after self-administration 
was paired with foot shock, but not during baseline sessions before foot 
shock. In agreement, other groups have proposed that cortical areas 
promote compulsive intake in the presence of a challenge, for example, 
in the face of aversive consequences, in contrast to striatal areas that 
would sustain habitual intake in the absence of such challenges*”. 
Thus, in the absence of conflict, optogenetic manipulation of a brain 
region that mediates inhibitory control would have no effects. 
However, when confronted with the possibility of punishment, pre- 
limbic cortex hypoactivity would bias behaviour towards compulsive 
seeking. Our results also suggest that prelimbic cortex stimulation 
might be clinically efficacious against compulsive seeking, with few 
side effects on non-compulsive reward-related behaviours in addicts; 
whether prelimbic cortex ChR2 stimulation reduces cocaine seeking 
through effects on emotional conditioning or cognitive processing'"* 
remains to be resolved. In conclusion, we provide an effective method 
to decrease compulsive drug seeking in an animal model that may 
closely mimic the human addiction condition, which supports the 
use of in vivo prefrontal cortex stimulation to treat compulsive drug use. 


METHODS SUMMARY 


Cocaine self-administration. Outbred male Wistar (Harlan) rats weighing 275- 
300 g at the beginning of the experiments were used for all experiments. Cocaine 
self-administration began 7 days after surgery. Rats were trained 6 or 7 days per 
week during the animal’s dark cycle. 

Ex vivo patch-clamp recording. Rats were deeply anaesthetized, perfused with 
ice-cold sucrose-substituted buffer, decapitated, and 250-|1m sections of the med- 
ial PFC were prepared. Whole-cell current-clamp recordings and nucleated patch- 
clamp recordings were performed using a potassium-based internal solution”®”’. 
In vivo optogenetic prelimbic cortex stimulation or inhibition during beha- 
viour. Adeno-associated viral vectors (AAV5 CaMKIIo::;ChR2-eYFP or AAV5 
CaMKIIo::eNpHR3.0-eYFP) were packaged by the University of North Carolina 
Vector Core Facility. Following surgery to implant a jugular catheter for intravenous 
cocaine self-administration (see Methods), virus (0.5 pil) was injected bilaterally into 
the prelimbic cortex at 0.1 ,1l min” '. Chronic optical fibres were implanted bilaterally 
directly above the prelimbic cortex. During optogenetic experiments, chronic optical 
implants were connected to optical patch cables coupled to 473-nm (for ChR2) or 
532-nm (for eNpHR3.0) lasers modulated by behavioural hardware (Med Associates). 
Chronic optical implants were constructed on site as previously described”. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Subjects. Adult male Wistar rats were singly housed in a climate-controlled faci- 
lity. Experiments followed the guidelines of the National Institutes of Health Guide 
For the Care and Use of Laboratory Animals, and were performed during the 
animals’ dark cycle. 
Catheter surgery. Rats were implanted with a chronically indwelling intravenous 
catheter, as previously described”. Briefly, rats were anaesthetized with isoflurane 
(2.5%) and a unilateral, intrajugular catheter was inserted 2.7-3.0 cm into the 
jugular vein. Catheters were sutured to the underlying muscle tissue and the other 
end externalized between the scapulas. Catheters consisted of a guide cannula 
(Plastics-One) attached to MicroRenathane tubing (0.025-inch inside diameter, 
0.047-inch outside diameter) and dental cement. The catheter was flushed daily 
with a saline solution containing gentamicin (5 mgml ', Butler Schein) to help 
maintain catheter patency and reduce infection. Following surgery, animals were 
allowed to recover for one week. During the first 5 days of the recovery period, 
animals were housed in home cages and had ad libitum access to food and water. 
Beginning on day 6, rats were food restricted to 20 g per day of standard rat chow. 
Rats were monitored daily to ensure that weight did not fall below 90% of ad 
libitum weight. Food restriction remained until the end of the experiment. Cocaine 
self-administration training began 7 days after surgery. Rats were food restricted to 
promote self-administration behaviour”. 
Stereotaxic surgery. Rats designated for optogenetic experiments received virus 
injection and chronic optic fibre surgery on the same day as catheter surgery. Rats 
were placed into a stereotaxic frame (David Kopf Instruments) and remained under 
isoflurane anaesthesia. Microinjection needles were then inserted bilaterally target- 
ing the prelimbic area (prelimbic cortex; coordinates from Bregma: +3.2 AP, + 1.2 
ML, —3.7 DV, 10° angle). Microinjections were performed using custom-made 
injection needles (26-gauge) connected to a 2-1] Hamilton syringe. Each prelimbic 
cortex hemisphere was injected with 0.5 ul of purified and concentrated AAV 
(~10'* infectious units ml’) encoding ChR2-eYFP or eNpHR3.0-eYFP under 
the control of the Camk2a promoter (AAV5 CaMKIIo::ChR2-eYFP or AAV5 
CaMKIlo::eNpHR3.0-eYFP) to infect prelimbic cortex glutamatergic neurons. 
Injections occurred over 10 min followed by an additional 10 min to allow diffusion 
of viral particles away from the injection site. Chronic optic fibres were implanted 
bilaterally over the injection site (+3.2 anteroposterior, + 1.2 mediolateral, —3.4 
dorsoventral, 10° angle). Fibres were secured to the skull with dental cement. 
General behavioural methods. Behavioural experiments were performed in 
standard operant chambers (Med Associates or Coulbourn Instruments) with 
two retractable levers set 5cm above the floor. A cue light was positioned 6 cm 
above each lever. The two levers flanked a food hopper. One lever was paired with 
cocaine infusion (the take lever) while the other lever was designated as the seek 
lever. Each chamber was also equipped with a house light and tone-generator. All 
sessions continued for 6h or until 30 or 80 cocaine infusions were earned (see 
below for detail methods on seek-take chain schedule), and were performed 6 or 7 
days a week for approximately 2 months. 
Acquisition of cocaine self-administration under the seek-take schedule. Seven 
days after surgery, rats began cocaine self-administration training using the seek- 
take chain schedule. The behavioural model used was modified from the proce- 
dures described previously’. Self-administration training was divided into four 
separate phases: acquisition of the taking response; training of the seek-take chain; 
extended training; and punishment. 
Acquisition of the taking response. In this initial phase of training, each trial 
began with the insertion of the take lever. Depression of the take lever on a fixed 
ratio (FR-1) resulted in the delivery of a single cocaine infusion (0.50 mg kg” | over 
3s). Cocaine infusions were paired with illumination of the cue light (20s) above 
the take lever, retraction of the take lever and extinction of the house light. 
Following a 30s time-out period, another trial was initiated with the insertion 
of the take lever and illumination of the house light. Training sessions terminated 
after either 30 earned cocaine infusions or 6 h, depending on which criterion was 
reached first. Training of the taking response continued until rats were able to earn 
30 cocaine infusions across three consecutive sessions. Typically, training contin- 
ued for five to seven sessions, after which rats advanced to training for the seek- 
take chain schedule. 
Training of the seek-take chain. Each trial in the seek-take chained schedule 
began with insertion of the seek lever (opposite to the take lever), with the take 
lever retracted. A single press on the seek lever resulted in the retraction of the seek 
lever and insertion of the take lever. A single depression of the take lever then 
triggered delivery of a single cocaine infusion, illumination of the cue light above 
the take lever, retraction of the take lever and extinction of the house light. 
Following a 30 s time-out, another trial was initiated with the insertion of the seek 
lever and house light illumination. 

Once the rats were able to earn 30 cocaine infusions, or had less than 20% 
variation in number of cocaine earned across three consecutive days, a random 
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interval (RI) schedule was introduced into the seek link of the chain schedule. 
Under the RI schedule, a trial began with the insertion of the seek lever and 
illumination of the house light. The first seek-lever press initiated the RI schedule. 
Three RI schedules were used across the training period: RI5, RI30 and R160. In the 
RIS schedule, immediately after the first seek-lever press, the software randomly 
selected 0.1, 5s, or 10s as the ‘random interval’. Seek-lever presses during the 
duration of the RI had no programmed consequences. The first seek-lever press 
following the end of the RI (in the case of RI5: 0.1 s, 5 s, or 10s) terminated the seek 
link of the chain and resulted in the retraction of the seek lever and the insertion of 
the take lever. In RI30 schedule, the software could randomly select 15, 30, or 45s, 
whereas for RI60 intervals of 45 s, 60 s or 75 s were used. One press on the take lever 
triggered cocaine infusion, paired with the identical cues as during the training of 
the take response. A time-out period followed each cocaine infusion. The time-out 
period started at 30s but progressively increased to 7 min by the end of training. 
The RI schedule was not altered within any individual training session. Training 
on each RI schedule persisted until the rats were able to earn 30 cocaine infusions, 
or have less than 20% variation across three consecutive days. Once an animal 
achieved stable cocaine seeking behaviour at a RI schedule, usually 4-6 days, it was 
advanced to the next RI schedule. Typically, a rat would spend 5 days in RI5, 
receive another 5 days in R130, and finally finish with another 5 days at RI60. 
Rats were given 10 min to complete each seek-take chain. Failure to complete 
the cycle within the 10min resulted in a lost opportunity to self-administer 
cocaine. A new trial started after the time-out period. More than 90% of rats in 
the present study earned 30 infusions under all three RI schedules. Daily training 
sessions terminated after either 30 earned cocaine infusions or 6h, depending 
upon which criterion was reached first. 

Extended training. After reaching the criterion for R160, rats were given extended 
access to cocaine. In this phase of training, rats were given eight sessions with only 
the take lever available. Rats could earn up to 80 cocaine infusions under an FR-1 
schedule with a post-reinforcement time-out of 30s. Interspersed in between the 
extended cocaine sessions were additional sessions of training on the seek-take 
chain schedule under RI60 schedule. At the conclusion of the extended training 
sessions, all rats received additional training on the seek-take chain schedule under 
RI60 schedule. Stable performance (earning all 30 cocaine infusions or >20% 
variation) across four consecutive days served as baseline for comparison for 
the effects of punishment and in vivo optogenetic stimulation. 

Punishment. During each punishment session, mild foot shocks (0.4 mA, 0.5 s) 
were administered in 30% of the trials. Shocks were delivered after the first lever 
press on the seek lever that occurred after the RI60 period was finished, and before 
extension of the take lever. Each punishment session consisted of 30 trials, and rats 
were given access to the take lever for cocaine infusions in all trials. Rats were given 
four punishment sessions. At the end of four days, shock-sensitive and shock- 
resistant rats were grouped based on the number of earned cocaine rewards. 

In vivo optical excitation sessions with ChR2. Shock-resistant rats received in 
vivo optogenetic excitation of the prelimbic cortex in which light stimulation 
(1 Hz, 10 ms, 10-15 mW, 473 nm)’'”* was administered in two separate sessions. 
The first session was administered one day after last RI60 baseline sessions 
(Fig. 3a). A separate session was administered one day after the fourth shock 
sessions; thus, there was a fifth day of foot shock during which ChR2 was stimu- 
lated using a laser. In each of the two ChR2 stimulation sessions, light stimulation 
was administered only during the seek link of the seek-take chain schedule. 
Insertion of the seek lever initiated activation of the laser. A 1 Hz stimulation 
was present throughout the seek link. Photostimulation was terminated at the 
end of the seek link. 

In vivo optical inhibition sessions with eNpHR3.0. Shock-sensitive rats received 
in vivo optogenetic inhibition of the prelimbic cortex, in which light stimulation 
(continuous, 10-15 mW, 532 nm)” was administered in two separate sessions. 
The first session was administered one day after the last RI60 baseline sessions 
(Fig. 4a). The second session was administered one day after the four shock 
sessions; thus, there was a fifth day of foot shock during which eNpHR3.0 was 
stimulated using a laser. Insertion of the seek lever initiated activation of the laser. 
Laser stimulation was continuously present throughout each seek link of the seek- 
take chain schedule. Photostimulation was terminated at the end of the seek link. 
Drugs. Cocaine HCl, picrotoxin and 6-cyano-7-nitroquinoxaline-2,3-dione dis- 
odium salt hydrate (CNQX) were purchased from Sigma-Aldrich. 

Slice preparation. Rats were euthanized one day after the last shock session. Rats 
were anaesthetized with 40 mgkg ' pentobarbital (intraperitoneally) and trans- 
cardially perfused with ~30 ml of nearly frozen (~0 °C) modified artificial cere- 
brospinal fluid (aCSE) at a rate of ~20 ml min” *. The modified aCSF for perfusion 
contained (in mM): 225 sucrose, 119 NaCl, 2.5 KCl, 1.0 NaH,PO,, 4.9 MgCh, 
0.1 CaCh, 26.2 NaHCOs, 1.25 glucose. After perfusion, the brain was quickly 
removed and placed into ice-cold aCSF for 1-2 min. Coronal sections containing 
the medial PFC (250 um) were prepared with VT-1200 vibratome (Leica). Slices 
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were placed in a holding chamber (containing aCSF with 1 mM ascorbic acid 
added 15min before brain dissection), and allowed to recover for at least 
30 min before being placed in the recording chamber and superfused with a 
bicarbonate-buffered solution saturated with 95% O, and 5% CO, and containing 
(in mM): 119 NaCl, 2.5 KCI, 1.0 NaH2POu,, 1.3 MgCh, 2.4 CaCl, 26.2 NaHCOs, 
and 11 glucose, at 32-34 °C. 

Electrophysiology. Picrotoxin (10014M) and CNQX (10M) were present 
throughout the experiment to block inhibitory and excitatory synaptic transmis- 
sion, respectively. Cells were visualized using infrared differential interference 
contrast video microscopy. Whole-cell current-clamp recordings were made using 
a MultiClamp 700B amplifier (Molecular Devices). Electrodes (2.8-4.0 MQ) con- 
tained (in mM): 120 potassium methanesulphonate (KMeSO,), 20 HEPES, 0.4 
EGTA, 2.8 NaCl, 5 TEA-Cl, 2.5 MgATP, and 0.25 NaGTP, biocytin (1%), pH 7.2- 
7.3 (270-285 mOsm). Series resistance (10-40 MQ) was continually monitored 
on-line with a —20 pA, 300 ms current injection given after every current injection 
step; if the series resistance changed by more than 20%, data were not included in 
the analysis. Membrane potentials were not corrected for junction potentials 
(estimated to be 10mV). After breaking into a neuron, the resting membrane 
potential was set to —80 mV by injecting DC current through patch amplifier. 
To measure the amount of current required to reach action potential threshold, a 
series of current steps (2 ms duration at 2.5 Hz, 0 to 2500 pA range with +10 pA 
step increments) were injected into the cell until an action potential was generated. 
To determine the input resistance, hyperpolarizing current injections (1s dura- 
tion, 0 to —100 pA in —25 pA step increments) were injected into the cell. Input 
resistance was taken at the linear part of the trace. Sustained firing was determined 
from a series of 11 current injections (1s duration, 50 pA steps). All values were 
obtained after the cells had reached a stable response and were averages of three 
cycles for each cell. 

Nucleated patch recordings were performed as described before to dissect 
K* current components’’. Briefly, recording pipettes were filled with K* 
based internal solution (in mM): 140 KMeSO,, 5 KCl, 0.05 EGTA, 2 MgCh, 
2 Na2ATP, 0.4 NaGTP and 10 HEPES (pH adjusted to 7.3). Recordings were made 
at room temperature (23-24 °C). Immediately after establishing whole-cell con- 
figuration from identified pyramidal cells, a negative pressure (— 150 to — 190 mbar) 
was applied through patch pipette to bring the nucleus close to the pipette tip. The 
patch pipette was then slowly withdrawn until the membrane patch surrounding the 
nucleus was completely pulled out of the cell. After excision, nucleated patches were 
held with a small constant negative pressure (—30 to —60 mbar) and were voltage 
clamped at a holding potential of —90 mV. P/—4 protocol in pClamp 10 was used for 
on-line subtraction of leakage and capacitive currents. A custom-made multi- 
barrel system consisting three glass capillaries was used to apply drugs to isolate 
different voltage-gated K* currents. Tetrodotoxin (TTX, 500nM) and cadmium 
(Cd?*, 100 1M) were included in HEPES-buffered ACSF (in mM: 138 NaCl, 
10 HEPES, 2.5 KCl, 2 CaCl, 1 MgCl and 25 glucose, pH 7.3) in each barrel to 
block voltage-gated Na* and Ca** currents, respectively. The K* current blockers 
4-aminopyridine (4-AP, 0.5mM) and tetraethylammonium (TEA, 20 mM) were 
included in different barrels, and nucleated patches were inserted into the barrels 
sequentially. Macroscopic voltage-gated K* current was elicited by a 150.ms test 
pulse stepping from —90 mV to +70 mV and up to 10 sweeps of recordings were 


collected and averaged to represent the current in each drug treatment. Drug- 
sensitive K* currents were then obtained by off-line subtraction and peak currents 
were measured to calculate the respective contribution in macroscopic K* current. 
To study background K* conductance, voltage-clamp recordings were made in the 
presence of 500nM TTX, 0.1mM Cd?*, 10mM TEA and 1mM CsCl to block 
voltage-gated Na‘, Ca*", K* currents and I,/inward rectifier currents, respectively. 
Cells were voltage-clamped at —70 mV. Under this condition, a slow voltage ramp 
from —130mV to —70 mV was applied across 1s before and after 1 mM barium 
(Ba’*) application. Pharmacological subtraction revealed Ba’*-sensitive current 
and background K* conductance was then calculated at —70 mV according to 
ref. 27. 

All recorded neurons were filled with biocytin and processed for post-hoc 
immunohistochemical staining to confirm the presence of apical dendrites. 
Only neurons exhibiting a clear apical dendrite, indicative of pyramidal neurons, 
were included in the analysis. All data were acquired at 20 kHz and filtered at 
10 kHz using Clampex 10.2 software (Molecular Devices). 

Data analysis. Statistical significance was primarily assessed using ANOVA, fol- 
lowed by post-hoc test when applicable. Data were analysed with Prism 
(GraphPad Software). 

Virus expression and histology. Following the completion of behavioural experi- 
ments, rats were deeply anaesthetized with euthasol and perfused transcardially 
with PBS followed by 4% paraformaldehyde dissolved in PBS. Brains were 
removed and fixed in 4% paraformaldehyde for an additional 24-48 h. Fifty- 
micrometre-sections of the PFC were made on a vibratome. Some slices were 
stained for 1h with 2% Neurotrace fluorescent Nissl stain (Invitrogen; excitation 
530 nm, emission 615 nm) diluted in PBS. Slices were then washed and mounted 
on gelatin-coated slides, treated with fluorescent-mounting media and mounted. 
Some slices were not stained with Neurotrace and were directly washed with PBS 
and mounted. In these slices, 4’,6-diamidino-2-phenylindole (DAPI) (0.15 pg 
ml’) was included in the fluorescent-mounting media. Expression of ChR2- 
eYFP and eNpHR3.0-eYFP was examined for all rats using a Nikon inverted 
fluorescent microscope with a X4 objective or an Olympus Fluoview FV 1000 with 
a X4 objective. Rats showing no bilateral eYFP expression in the PFC owing to 
faulty microinjections, and showing bilateral fibre placement outside the PFC were 
excluded from analysis. 

Reconstruction of optical stimulation sites in the prelimbic cortex. Optical 
stimulation sites of in vivo optogenetic experiments were determined as described 
previously”°. Briefly, fixed and stained coronal brain sections containing the mPFC 
were examined with Nikon inverted fluorescent microscope or an Olympus 
Fluoview FV1000. Optic fibre tracks were located in the slice and optical stimu- 
lation sites were determined as 0.5 mm below the fibre tip (Supplementary Figs 8, 9 
and 11). 


28. Chen, B. T. et a/. Cocaine but not natural reward self-administration nor passive 
cocaine infusion produces persistent LTP in the VTA. Neuron 59, 288-297 (2008). 

29. Knackstedt, L. A. & Kalivas, P. W. Extended access to cocaine self-administration 
enhances drug-primed reinstatement but not behavioral sensitization. 
J. Pharmacol. Exp. Ther. 322, 1103-1109 (2007). 

30. Stuber, G. D. et a/. Excitatory transmission from the amygdala to nucleus 
accumbens facilitates reward seeking. Nature 475, 377-380 (2011). 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


doi:10.1038/nature12037 


Visualization of an endogenous retinoic acid gradient 
across embryonic development 


Satoshi Shimozono’, Tadahiro Iimura', Tetsuya Kitaguchi’, Shin-ichi Higashijima* & Atsushi Miyawaki'* 


In vertebrate development, the body plan is determined by prim- 
ordial morphogen gradients that suffuse the embryo. Retinoic acid 
(RA) is an important morphogen involved in patterning the 
anterior—posterior axis of structures, including the hindbrain’ 
and paraxial mesoderm”*. RA diffuses over long distances, and 
its activity is spatially restricted by synthesizing and degrading 
enzymes’. However, gradients of endogenous morphogens in live 
embryos have not been directly observed; indeed, their existence, 
distribution and requirement for correct patterning remain con- 
troversial’®. Here we report a family of genetically encoded indica- 
tors for RA that we have termed GEPRAs (genetically encoded 
probes for RA). Using the principle of fluorescence resonance 
energy transfer we engineered the ligand-binding domains of RA 
receptors to incorporate cyan-emitting and yellow-emitting fluor- 
escent proteins as fluorescence resonance energy transfer donor 
and acceptor, respectively, for the reliable detection of ambient 
free RA. We created three GEPRAs with different affinities for 
RA, enabling the quantitative measurement of physiological RA 
concentrations. Live imaging of zebrafish embryos at the gastrula 
and somitogenesis stages revealed a linear concentration gradient 
of endogenous RA in a two-tailed source-sink arrangement across 
the embryo. Modelling of the observed linear RA gradient suggests 
that the rate of RA diffusion exceeds the spatiotemporal dynamics 
of embryogenesis, resulting in stability to perturbation. Further- 
more, we used GEPRAs in combination with genetic and pharma- 
cological perturbations to resolve competing hypotheses on the 
structure of the RA gradient during hindbrain formation and somi- 
togenesis. Live imaging of endogenous concentration gradients 
across embryonic development will allow the precise assignment of 
molecular mechanisms to developmental dynamics and will acce- 
lerate the application of approaches based on morphogen gradients 
to tissue engineering and regenerative medicine. 

RA is a small lipophilic molecule that acts as a ligand for nuclear RA 
receptors (RARs). RA is synthesized from retinal by retinaldehyde 
dehydrogenase type 2 (Raldh2) and degraded into polar metabolites 
by Cyp26 (Supplementary Fig. 1a). During early vertebrate embryo- 
genesis, regions of RA synthesis and degradation were mapped by 
detecting messenger RNA encoding these metabolizing enzymes. In 
zebrafish, raldh2 is expressed in the mid-trunk, whereas cyp26 is 
expressed at both the anterior and posterior ends’’. We therefore 
speculate that a two-tailed gradient of RA could form with the highest 
concentration in the mid-trunk and tapering off at each end (Sup- 
plementary Fig. 2). Gradient formation of other morphogens, all of 
which are genetically encoded peptides, could be observed by fusing 
them to green fluorescent protein (GFP) and introducing them exo- 
genously into embryos’*"*. However, the non-peptidic structure of RA 
makes it difficult to image directly’’. It therefore remains unknown 
whether postulated RA gradients exist, have a linear distribution and 
are required for normal embryonic development. 

To address this gap we developed GEPRAs. The ligand-binding 
domains (LBDs) from mouse RARs were flanked by cyan fluorescent 


protein (CFP) and yellow fluorescent protein (YFP), mutants of 
Aequorea GFP (Fig. 1a). Alterations in the conformation of the LBD 
in response to RA binding are converted into changes in fluorescence 
resonance energy transfer (FRET) from CFP to YFP. Among the con- 
structs containing the LBD from RAR-f (Supplementary Fig. 3a), the 
GEPRA-B fusion protein showed the largest response in HeLa cells 
(Supplementary Fig. 4). In situ calibration for the intracellular RA 
concentration ([RA],) demonstrated an apparent dissociation constant 
(K'g) of 2nM (Fig. 1b). Responses to the RA precursors retinal and 


b 0.5 

@ 0.5 4 2 
: poa 8 
w fn 
9 044 EY 
2 bo.3 2 
© 2 

0.01 0.1 1 10 100 


Concentration (nM) 


Ratio (CFP/FRET) 


C Transmission 
0.5 


d GEPRA-B_0.5__—sraldh2/cyp26s ro) 


Bud Li : 
u| L>3 
0.44 r2 
L4 
= 5 
5 — 


e GEPRA-G_ 0.45 


Ratio 
(WY) 'vu] 


0.44 [6 
x 
2 4 f4 = 
a = 
fod = 
0.34 l2 Ss 

ij 5 
) 500 1,000 1,500 


Head-to-tail distance (tum) 


Figure 1 | Development and characterization of GEPRAs. a, Schematic 
representation of GEPRA. The light and dark yellow denote the unbound and 
RA-bound states of GEPRA, respectively. b, RA titration curves for GEPRA-B 
(red) and GEPRA-G (blue). Rinax (RA-depleted form) and Rin (RA-saturated 
form) values for GEPRA-B were slightly higher than those of GEPRA-G. Fitting 
with the Michaelis-Menten equation yielded K’g values of 2 and 4nM for 
GEPRA-B and GEPRA-G, respectively. Each data point is the mean = s.d. for 
nine experiments. c, Transmission and fluorescence (YFP and FRET ratio) 
images of a five-somite embryo from the GEPRA-B transgenic zebrafish line. 
d, e, Ratiometric [RA]; images (left), whole-mount in situ hybridization results 
for raldh2 and cyp26s (centre), and graphs of the spatial distributions of raldh2 
and cyp26s (right) in bud-stage embryos expressing GEPRA-B (d) and GEPRA- 
G (e). Absolute [RA]; values are displayed on the right side of each graph. 
Scale bars, 200 tum. 
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retinol were almost negligible (K’g >> 100 nM; Supplementary Fig. 1b). 
We speculated that using multiple GEPRAs with different RA affinities 
would allow us to measure [RA]; quantitatively and create two addi- 
tional GEPRA variants. GEPRA-AA was generated by introducing two 
amino-acid substitutions in the LBD of GEPRA-B. Of the constructs 
containing the LBD from RAR-y, GEPRA-G produced the best results 
(Supplementary Fig. 3b). GEPRA-AA and GEPRA-G produced K’g 
values of 50 nM (Supplementary Fig. 5a) and 4 nM (Fig. 1b), respectively. 

We generated transgenic zebrafish lines ubiquitously expressing 
GEPRA-B or GEPRA-G. In a five-somite embryo expressing GEPRA- 
B, the probe was uniformly distributed (Fig. 1c, YFP), but the CFP/FRET 
ratio—that is, [RA];—was highest in the mid-trunk and lowest in 
the head and tail (Fig. 1c, ratio). Next we depleted embryos of RA 
with 10 uM 4-(diethylamino)benzaldehyde (DEAB), an inhibitor of 
Raldh2. DEAB abolished the GEPRA-B-derived high [RA]; signal in 
the mid-trunk (Supplementary Fig. 6). This indicates that the probe 
is RA-specific because DEAB should increase retinal and retinol 
concentrations. 

We compared signals representing [RA]; with the spatial expression 
patterns of RA-metabolizing enzymes. Immediately after imaging with 
GEPRA-B (Fig. 1d, left), bud-stage embryos were fixed and subjected 
to in situ hybridization with probes specific for raldh2 and cyp26 
species (cyp26s; a mixture of cyp26al, cyp26b1 and cyp26c1)° (Fig. 1d, 
centre). Comparative spatial profiles indicated that the signals repre- 
senting [RA], were high in the raldh2-expressing region and low in 
the cyp26s-expressing region (Fig. 1d, right). Two intermediate zones 
flanked by the raldh2-expressing and cyp26s-expressing regions were 
of particular interest. In the posterior zone, the signal representing 
[RA]; was graded almost linearly. In the anterior zone, however, the 
probe saturated near the raldh2-expressing region. To examine high 
levels of [RA]; quantitatively, we imaged GEPRA-G transgenic zebra- 
fish at the bud stage (Fig. le) and found linear [RA]; gradients in 
both the anterior and posterior intermediate zones. We also expressed 
GEPRA-AA transiently by injecting fertilized wild-type eggs with 
mRNA. GEPRA-AA produced no [RA]; gradient in three-somite 
embryos (Supplementary Fig. 5b). On the basis of the K’g values of 
the three GEPRAs, the highest [RA]; situated anteriorly within the 
raldh2-expressing region was estimated to be 6nM (Fig. le). All of 
the gradients identified by using both GEPRA-B and GEPRA-G 
(Fig. 1d, e) reached the edges of the cyp26s-expressing regions, indi- 
cating that the steady-state distribution of RA is linear. This finding 
supports the model in which a local source and a local sink together 
generate a linear gradient in the flanked region based on a simple 
diffusion model’* within the spatiotemporal parameters of embryo- 
genesis’. We examined whether a rectangular distribution of [RA]; 
could exist stably in the intermediate zones with a computer simu- 
lation. Assuming that the diffusion coefficient of RA and the width of 
the zone were 10 uum? s ! (ref. 20) and 200 tm, respectively, our simu- 
lation showed that a rectangular distribution created a linear gradient 
in about 10 min (Supplementary Fig. 7), which is much faster than the 
time scale of embryonic growth and supports a source-sink model in 
the formation of a linear gradient by simple diffusion. 

RA signalling in zebrafish embryos was previously observed using 
an exogenous reporter gene system, in which an RA response element 
(RARE) drives the expression of enhanced YFP (eYFP)’'. However, 
RARE-eYFP transgenic zebrafish did not produce specific fluores- 
cence signals until very late, at roughly the 18-somite stage (Sup- 
plementary Fig. 8a), and even at the 20-somite stage eYFP fluorescence 
was detected only in old somites. In contrast, the GEPRA-B signal 
indicating a high [RA]; was distributed from the region near the otic 
vesicle to the most posterior somite (Supplementary Fig. 8b). At this 
stage, raldh2 mRNA accumulated in the eyes and all somites, whereas 
mRNAs encoding cyp26s were abundant in the head region and tail tip 
(Supplementary Fig. 8c); these patterns agree with the distribution 
of GEPRA-B signals representing [RA];. Thus, RARE-eYFP, which 
unlike GEPRAs is an indicator of RA signalling and not [RA];, is not 
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sensitive in zebrafish embryos because the eYFP chromophore takes a 
relatively long time to mature compared with the rapid timescale of 
zebrafish embryogenesis. 

We verified that the [RA]; gradient in the head region at the bud 
stage (Fig. ld,e) was contained within the hindbrain field (Sup- 
plementary Fig. 9). We also found that the signal representing high 
[RA]; in the mid-trunk was detectable at 75% epiboly and developed 
into a clear peak at the tailbud stage (Fig. 2a). Given these observations, 
we were able to resolve a historical controversy about the putative RA 
gradient in the hindbrain*®. The major argument against the pre- 
sence of this gradient is the observation that embryos depleted of 
endogenous RA can be fully rescued with a uniform concentration 
of exogenous RA®*. To examine whether this approach resulted in a 
rectangular RA distribution, we performed rescue experiments with 
the GEPRA-B transgenic line (Fig. 2b). At 4h after fertilization (hpf), 
embryos were exposed to 10 1M DEAB and various concentrations of 
RA. The embryos were assessed for [RA]; at the three-somite to four- 
somite stages and for morphology at 36hpf. Treatment with 10 uM 
DEAB (Fig. 2b, second column of images) nearly abolished the signal 
representing high [RA]; and the imaged embryo developed a kinked 
head, which is a characteristic of RA depletion in zebrafish”. When 
10nM RA was applied together with 10 1M DEAB (Fig. 2b, fourth 
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Figure 2 | [RA]; gradient during hindbrain development. a, Time-lapse 
[RA]; imaging of GEPRA-B in an embryo from 70% epiboly to bud stage. A 
series of transmission images (top), [RA]; images (middle) and ratio profiles 
(bottom) are shown. The most anterior and most posterior points represented 
in the ratio profiles are indicated in the transmission images by cyan and green 
dots, respectively. Times since the start of imaging are shown above the figures. 
b, Visualization of [RA]; in GEPRA-B-bearing embryos (three-somite (3S) to 
four-somite (4S) stage) with and without DEAB to deplete RA, and various 
concentrations of exogenous RA as denoted above the images. In each column 
reflecting the various treatment conditions, transmission and [RA]; images are 
accompanied by a graph showing the spatial distribution of [RA]; along the 
anterior—posterior axis of the body. A transmission image of a later 
developmental stage (36 hpf) is also presented. A red arrowhead indicates a 
kink in the head region. Scale bars, 200 um. 
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column), however, a signal reflecting high [RA]; was detected. 
Although full recovery was not observed, a substantial [RA]; gradient 
was created in the hindbrain field. The gradient was sufficient to allow 
normal embryo development. Under the same conditions, 1nM RA 
(Fig. 2b, third column) resulted in a smaller [RA]; gradient and did not 
rescue the zebrafish from the effects of RA depletion. The results were 
reproduced for each treatment condition with multiple embryos (Sup- 
plementary Fig. 10). These perturbation experiments revealed that the 
distribution of RA for hindbrain patterning is reliably governed by 
local degradation of RA, as proposed previously*°. Such [RA]; gradi- 
ents depending on RA degradation were also observed during somi- 
togenesis (Supplementary Fig. 11). In the zebrafish mutant giraffe (gir) 
containing a mutation in the cyp26a1 gene, there are patterning defects 
in various organs”. We injected GEPRA-B mRNA into gir embryos, 
and did indeed find a global increase in [RA]; in six-somite embryos 
(Supplementary Fig. 12). 

Another essential morphogen that functions during the formation 
of the embryonic axis includes the fibroblast growth factor (FGF) 
family’***™*. The interactions between RA and FGF signalling include 
positive and negative feedback and feedforward mechanisms. To 
examine how Fef8 regulates endogenous RA gradients, we performed 
[RA], imaging experiments using embryos in which Fef8 signalling 
was suppressed. We injected fgf8 splice-blocking morpholino oligo- 
nucleotides (MOs)** into one-cell-stage embryos to phenocopy the 
well-characterized zebrafish mutant acerebellar (ace)*®*’. As a result 
of mutation of the fgf8 gene, homozygous ace embryos lack both a 
cerebellum and organizer in the midbrain—-hindbrain boundary while 
retaining expression of some rhombomere-marker genes”. Because 
RA signalling is required for the formation of posterior segments of 
the hindbrain”, such as rhombomeres 5-7, we assessed whether the 
RA gradient was retained in the hindbrain field of fgf8 morphants 
expressing GEPRA-B. Time-lapse transmission images confirmed 
effective knockdown of fgf8 expression based on a bulge in the area of 
the developing midbrain (arrowheads)* and shortened tail (Fig. 3a, b). 
In the MO-treated embryos, the signal representing high [RA]; in the 
mid-trunk was observed until 12.5 hpf (Fig. 3b). Normal posteriori- 
zation of their hindbrain*® was verified by using in situ hybridization 
with two rhombomere markers (Supplementary Fig. 13). In addition, 
the expression profiles of raldh2 and cyp26s were unaffected at 10 hpf 
(Fig. 3c). After 13.5 hpf, however, high [RA]; was attenuated to baseline 
levels in fgf8 morphants (Fig. 3b). We found that this decrease in [RA]; 
was due to decreased levels of raldh2 expression (Fig. 3d), indicating 
that fgf8 may have a role in the maintenance of raldh2 gene expression. 
The dovnnepulalion of raldh2 expression was previously observed 
in ace mutants”*®; however, the GEPRA system allowed us to directly 
image the effect on the RA gradient after the loss of raldh2 expression. 

The free diffusion of a lipophilic molecule such as RA may be 
limited in aqueous environments. A large fraction of RA molecules 
are probably transported intracellularly by interaction with cellular 
RA-binding proteins*”. On the assumption that most intracellular 
RAs are bound to carrier proteins, GEPRAs detect the free form of RA. 
The endogenous RA-buffering system is reminiscent of intracellular 

* buffering in which loading cells with a large amount of high- 
affinity Ca** probes does not markedly affect intracellular Ca** 
dynamics”’. Thus, the expression of GEPRAs can be increased without 
disturbing endogenous RA dynamics or the normal embryonic deve- 
lopment of fish. In support of this, GEPRA transgenic embryos developed 
without apparent morphological aberrations. 

Here we have generated fluorescent probes that allowed direct visu- 
alization of the endogenous RA gradient in live embryos. The existence 
of an RA gradient is a matter of historical debate*®. Our GEPRA 
system clearly demonstrated that a two-tailed linear RA gradient exists 
during early embryogenesis, and that Fgf8 is not required for RA 
gradient formation in the hindbrain but is required for the mainten- 
ance of raldh2 expression and the RA gradient later in development. 
The ability of GEPRA to directly reveal endogenous morphogen 
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Figure 3 | [RA]; gradient is affected by fgf8 expression. a, b, Time-lapse 
[RA]; imaging of GEPRA-B-bearing embryos from 10.5 to 16.5 hpf after 
injections of control MO (a) or fgf8-specific MO (b). Transmission images (top 
row), [RA]; images (middle row) and ratio profiles (bottom row) are presented. 
The control embryo showed normal somitogenesis; somite stages are labelled in 
the transmission images (a). A bulge in the area of the developing midbrain in 
the embryo injected with the fgf8-specific MO is indicated by black arrowheads; 
a signal representing a high [RA]; in the trunk is indicated by white arrows 
(b). c, d, Comparative in situ hybridizations showing expression of raldh2 and 
cyp26s in embryos at 10 hpf (c) and 14hpf (d) after injections with control or 
fgf8-specific MO. All images are lateral views. Scale bars, 200 um. 


gradients such as RA in live embryos will allow a greater under- 
standing of their roles and mechanisms in patterning the vertebrate 
embryo and will support gradient-based approaches in medicine and 
bioengineering. 


METHODS SUMMARY 


The genes for GEPRAs were constructed from complementary DNAs encoding 
the LBD of mouse RAR-B or RAR-y, and subcloned into pCS2 for expression in 
HeLa cells and for in vitro synthesis of mRNA or into pT2KXIGAin for generation 
of transgenic zebrafish lines. [RA]; imaging was performed with an inverted con- 
focal microscope equipped with a X10 objective lens and a 440-nm laser. With the 
aperture fully open, non-confocal fluorescence images were acquired. The dual- 
emission ratio imaging was performed under exactly the same conditions for in 
situ [RA]; calibration (HeLa cells) and in vivo time-lapse two-dimensional [RA]; 
imaging (zebrafish embryos). Thus, absolute [RA]; values in the two-dimensional 
image can be obtained from the ratios by using the curves shown in Fig. 1b. RA, 
retinol, retinal and DEAB were administrated with 0.1% dimethylsulphoxide 
(DMSO). The morphology of an embryo was observed with the confocal micro- 
scope simultaneously with [RA]; or with a stereomicroscope afterwards. The 
transgenic zebrafish line (Tg(RARE-gata2:NTD-eYFP)Id1)*' was obtained from 
the Zebrafish International Resource Center (University of Oregon). The eYFP 
fluorescence was observed with an inverted confocal microscope equipped with a 
473-nm or 488-nm laser. cDNA encoding raldh2, cyp26al, cyp26b1, cyp26cl, 
krox20, hoxd4, otx2 or myoD in pCS2 was used to construct the probe for in situ 
hybridization. All embryos were allowed to grow after live [RA]; imaging, unless 
they were subjected to in situ hybridization. Embryos developed without any 
morphological aberrations, indicating that GEPRA expression and light exposure 
are not morphogenic. 
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Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Materials. RA, retinol, retinal and DEAB were purchased from Sigma-Aldrich. 
These compounds were dissolved in dimethylsulphoxide (DMSO) to prepare 
stock solutions. The compounds were administered in a final concentration of 
0.1% DMSO. 

Gene construction. cDNA encoding the LBD of RAR-§ (FANTOM3 (ref. 31), 
6820403N24) was amplified using primers containing 5’ SphI and 3’ Sacl sites. 
The digested product was ligated to a BamHI/SphI fragment encoding CFP 
and a SacI/Xhol fragment encoding YFP. The ligated product was subcloned into 
the BamHI/Xhol sites of the pCS2 vector for mammalian expression. In this 
plasmid, genes encoding the donor and/or acceptor were replaced to improve 
the RA indicator. Similarly, cDNA encoding RAR-y (FANTOM3 (ref. 31), 
F730319B19) was used to make a Sphl/Sacl fragment encoding the LBD. 
Together with the BamHI/SphI fragment encoding CFP and a SacI/EcoRI frag- 
ment encoding YFP, cDNA coding for the indicator was subcloned into the 
BamHI/EcoRI sites of the pCS2 vector. Introduction of two amino-acid substitu- 
tions (R269A and S280A)* in the LBD of RAR-B was performed as described 
previously’. 

Characterization of the indicators. HeLa cells were grown in DMEM medium 
supplemented with 10% FBS. cDNAs were transfected into cells by using 
Lipofectamine and Plus reagent (Invitrogen). Two days after transfection, the 
medium was exchanged with DMEM/F12 without phenol red. Dose-response 
curves were determined with nine transfected cells. 

Generation of GEPRA transgenic zebrafish lines. GEPRA cDNAs were sub- 
cloned into the pT2KXIGAin vector, and transgenic lines were created as 
described previously**. Fish were maintained at 28.0 °C. 

In vivo RA imaging. A glass bead (Iuchi BZ-1) was placed on a coverslip, and 
1% agarose (Takara L03) in E3 medium (5 mM NaCl, 0.17 mM KCl, 0.4mM CaCh, 
0.16 mM MgSO,) was poured on the coverslip and allowed to harden. The glass 
bead was then removed to generate a round chamber. An embryo that had been 
anaesthetized with Tricaine at more than 16 hpf was placed in the chamber and 
covered with 0.3% agarose in E3 medium. The chamber was submerged in E3 
medium containing Tricaine. Time-lapse two-dimensional imaging was performed 
in the xy-t mode with an FV1000 (Olympus) confocal inverted microscope system 
equipped with a X10 objective lens (numerical aperture 0.4; UPlanApo) and a 440- 
nm laser. With the aperture fully open, non-confocal fluorescence images were 
acquired. The ratio imaging of embryos was performed under exactly the same 
conditions as for calibration experiments with HeLa cells. Thus, absolute [RA]; 
values in the image can be obtained from the ratios using the curves shown in 
Fig. 1b. Image processing was performed with ImageJ software (http://rsb.info.nih. 
gov/ij/). After the background had been subtracted, ratio images were generated and 
median-filtered. 

Estimation of GEPRA concentration in embryos. Confocal fluorescence images 
were acquired with a GEPRA transgenic embryo and a series of YFP solutions of 
various concentrations with excitation at 488 nm. The size of the confocal aperture 
was about 2 Airy disks. By comparing the intensities of the confocal fluorescence 
images, the GEPRA concentration in the embryo was estimated to be approxi- 
mately 0.1 1M. Considering the concentration® and affinity for RA (refs 36, 37) of 


LETTER 


cellular RA-binding proteins, which were reported to be 10 1M and 0.13-2 nM, 
respectively, only a small fraction of RA molecules should be bound to GEPRA. 

Transgenic RARE-eYFP zebrafish. Transgenic zebrafish (RARE-gata2:NTD- 
eYFP)Id1)*! were obtained from the Zebrafish International Resource Center 
(University of Oregon). Fluorescence was observed with an FV1000 confocal 
microscope (Olympus) with a 488-nm laser or an FV10i confocal microscope 
(Olympus) with a 473-nm laser. 

Depletion of endogenous RA and application of exogenous RA. Embryos were 
incubated in E3 medium containing 0.1% DMSO (vehicle), 10 1M DEAB, 1nM 
RA with 10 1M DEAB, or 10nM RA with 10 uM DEAB from 4 to 24hpf. After 
24 hpf, embryos were bathed in E3 medium. RA imaging was performed at 11 hpf 
(roughly the three-somite stage). The imaged embryos were kept at 28.0 °C until 
their morphologies were examined at 36 hpf with a stereomicroscope (MZ16 F; 
Leica) equipped with a charge-coupled device (CCD) camera (DP50; Olympus). 
In situhybridization. cDNA encoding raldh2, cyp26a1, cyp26b1, cyp26c1, krox20, 
hoxd4, otx2 or myoD was amplified from a cDNA library prepared from 10-hpf or 
24-hpf embryos and cloned into the pCS2 vector. Probes were labelled with a 
digoxigenin or fluorescein labelling mix (Roche Diagnostics) and detected with 
alkaline phosphatase-conjugated antibodies (1:5,000; Roche Diagnostics). The 
colorimetric reaction was performed with BM purple reagent (Roche Dia- 
gnostics). Images were captured with a stereomicroscope (MZI16F; Leica) 
equipped with a CCD camera (DP50; Olympus). Fluorescence and in situ hybridi- 
zation images were linearly registered with the Image Processing Toolbox in 
MATLAB software (MathWorks). 

Knock down of fgf8. Splice-site-targeted morpholino oligonucleotides for the fgf8 
gene (E212 (5'-TAGGATGCTCTTACCATGAACGTCG-3’) and E313 (5’-CAC 
ATACCTTGCCAATCAGTTTCCC-3’)) were used to block its pre-mRNA splic- 
ing in zebrafish embryos”. One-cell-stage embryos were injected with both E212 
and E3I3 (2.5ng of each) or a control oligonucleotide (5ng; 5'-CCTCTTA 
CCTCAGTTACAATTTATA-3’). 
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A pathogenic picornavirus acquires an envelope by 
hijacking cellular membranes 


Zongdi Feng’, Lucinda Hensley’, Kevin L. McKnight’, Fengyu Hu’, Victoria Madden’, LiFang Ping’, Sook-Hyang Jeong’, 


Christopher Walker*, Robert E. Lanford? & Stanley M. Lemon’®’ 


Animal viruses are broadly categorized structurally by the presence 
or absence of an envelope composed of a lipid-bilayer membrane’, 
attributes that profoundly affect stability, transmission and immune 
recognition. Among those lacking an envelope, the Picornaviridae 
are a large and diverse family of positive-strand RNA viruses that 
includes hepatitis A virus (HAV), an ancient human pathogen that 
remains a common cause of enterically transmitted hepatitis” *. 
HAV infects in a stealth-like manner and replicates efficiently in 
the liver®. Virus-specific antibodies appear only after 3-4 weeks of 
infection, and typically herald its resolution**. Although unex- 
plained mechanistically, both anti-HAV antibody and inactivated 
whole-virus vaccines prevent disease when administered as late as 
2weeks after exposure’, when virus replication is well established 
in the liver®. Here we show that HAV released from cells is cloaked 
in host-derived membranes, thereby protecting the virion from 
antibody-mediated neutralization. These enveloped viruses (‘eHAV’) 
resemble exosomes’, small vesicles that are increasingly recognized 
to be important in intercellular communications. They are fully infec- 
tious, sensitive to extraction with chloroform, and circulate in the 
blood of infected humans. Their biogenesis is dependent on host 
proteins associated with endosomal-sorting complexes required for 
transport (ESCRT)*, namely VPS4B and ALIX. Whereas the hijacking 
of membranes by HAV facilitates escape from neutralizing antibodies 
and probably promotes virus spread within the liver, anti-capsid 
antibodies restrict replication after infection with eHAV, suggesting 
a possible explanation for prophylaxis after exposure. Membrane 
hijacking by HAV blurs the classic distinction between ‘enveloped’ 
and ‘non-enveloped’ viruses and has broad implications for mecha- 
nisms of viral egress from infected cells as well as host immune 
responses. 

Supernatant fluids of hepatoma cell cultures infected with low- 
passage, non-cytopathic HAV? (Fig. 1A) contain two populations of 
virus particles that are resolved in isopycnic iodixanol gradients 
(Fig. 1B). One bands at a low density consistent with membrane asso- 
ciation (1.06-1.10 gcm °, fractions 8-12) and is not detected in a 
capsid antigen enzyme-linked immunosorbent assay (ELISA) 
(Fig. 1C, left), whereas the other bands at the density expected for 
picornaviruses (1.22-1.28gcm °, fractions 18-22) and is readily 
detected by ELISA. Electron microscopy of the light fractions revealed 
numerous virus-like particles enclosed in membranes (see a-d in 
Fig. 1D, and Supplementary Fig. 1a) with morphology indistinguish- 
able from ~27-nm HAV particles in dense fractions (see e in Fig. 1D). 
These membrane-bound structures ranged from 50 to 110 nm in dia- 
meter, a size similar to that of exosomes’, and contained one to four 
virus-like particles (Supplementary Fig. 1b). Consistent with this, viral 
RNA banded in gradients with capsid protein (VP2) and the exosome- 
associated protein flotillin-1 (Supplementary Fig. 1c). 


A modified plaque assay (infrared fluorescence immunofocus assay; 
IR-FIFA)’° revealed the membrane-wrapped particles to be infectious 
(Fig. 1E) with a specific infectivity equivalent to that of virions 
(Fig. 1F). Extraction with chloroform, a classic method for distinguish- 
ing enveloped from non-enveloped viruses, had no effect on standard 
virions but resulted in a 2log; decrease in infectious virus in the light 
fraction (Fig. 1E and Supplementary Fig. 2a). We call these membrane- 
wrapped HAV particles ‘enveloped HAV’ (eHAV). Capsid antigen 
could be detected in the eHAV fraction after treatment with a nonionic 
detergent (1% Igepal CA-630; NP-40) (Fig. 1C, right). This shifted the 
viral particles to an intermediate density in iodixanol gradients (1.15- 
1.17gcm_*) but did not destroy infectivity (Supplementary Fig. 2b, c). 
A potent, neutralizing, monoclonal antibody (mAb), K24F2 (ref. 11), 
failed to neutralize eHAV (Fig. 1G), providing further evidence for 
complete envelopment of the capsid. 

In 12 experiments, eHAV represented 79 + 13% (mean + s.d.) of 
virus in medium from infected cell cultures. HAV was released from 
multiple cell types and was also observed with high-passage cytopathic 
virus (Supplementary Fig. 3a—-e). Gradient-purified eHAV contained 
mostly mature VP2 (Fig. 1H, lane 3, and Supplementary Fig. 2d), 
indicating that enveloped virions had undergone maturation cleavage 
of VP0 to VP4 + VP2. However, whereas non-enveloped virions con- 
tained fully processed VP1, eHAV contained primarily unprocessed 
VP1pX (Fig. 1H; compare lanes 3 and 4). pX is an unusual 8-kDa 
carboxy-terminal extension on VP1 that is unique to HAV among 
picornaviruses (Fig. 1A). It functions in virion assembly and is pro- 
cessed from VP1 by an unidentified host protease late in the viral 
lifecycle’*"*. pX was protected from proteinase K in eHAV particles, 
but was rendered susceptible to digestion after treatment with NP-40 
(Supplementary Fig. 2e). Thus, pX is fully enclosed in membranes. We 
conclude that most HAV is released enveloped in host membranes, a 
process we term ‘membrane hijacking’. 

Infectious virus circulating in the blood of infected humans and 
chimpanzees (Pan troglodytes) possesses the buoyant density of 
eHAV (Fig. 2a and Supplementary Fig. 4a-e), confirming the rele- 
vance of these observations to HAV pathogenesis. However, virus in 
faeces is non-enveloped (Fig. 2b and Supplementary Fig. 4c). Because 
most faecal virus is produced in the liver'*’*, the membrane may be 
stripped from eHAV during passage through the biliary tract to the 
gut. However, suspending eHAV in bile minimally altered its buoyant 
density (Supplementary Fig. 5a). 

It has been suggested’* that poliovirus, a distantly related picorna- 
virus, could be released from cells wrapped in a single membrane if 
entrapped in an autophagosome that fuses with the plasma membrane. 
However, RNA-mediated interference knockdown of beclin-1, which 
mediates autophagosome formation, did not inhibit eHAV release 
(Supplementary Fig. 6a, b). In contrast, knockdown of VPS4B or 
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ALIX, which are ESCRT-II-associated proteins that facilitate the bud- 
ding of many enveloped viruses'”°, inhibited the release of eEHAV 
from cells (Fig. 3a-c and Supplementary Figs 6c and 7a) but not viral 
RNA replication (Fig. 3d), intracellular assembly of capsids or encap- 
sidation of viral RNA (Supplementary Fig. 7a—c). ESCRT complexes 
act sequentially to sort and load cargo into multivesicular bodies 
(MVBs)*. VPS4B is an ATPase that provides energy for the dissolution 
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8-12 microscopic images of negatively stained eHAV 
(a-d from fraction 10 in B) and a non-enveloped 
virion (e, fraction 20 in B). E, Infectious titre of 
pooled fractions containing eHAV or non- 
enveloped virions before and after extraction with 
chloroform. FFU, focus-forming units. F, Specific 
infectivity of pooled fractions containing eHAV or 
non-enveloped virions, calculated by dividing the 
HAV RNA copy number (GE, qRT-PCR) by the 
infectious titre (FFU, IR-FIFA). Values shown are 
means and range for duplicate RT-PCR reactions. 
G, eHAV is resistant to neutralization by anti- 
capsid monoclonal antibody K24F2 (ref. 11). 
Antibody-virus mixtures were incubated for 1 h at 
37 °C and inoculated onto cells for 1 h, followed by 
removal of the inoculum, washing three times with 
PBS, and the addition of an agarose overlay. 

Viral antigen was detected by IR-FIFA"”. 

H, Immunoblots of HAV capsid proteins (VP1 and 
VP2) in lysates of mock-infected or HAV-infected 
cells (lanes 1 and 2), gradient-purified eHAV (lane 
3) and chloroform-extracted non-enveloped 
virions (lane 4). 
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of ESCRT-III complexes at membrane fission, whereas ALIX 
contributes to exosome biogenesis and also facilitates lentivirus 
budding*”’**. Knockdown of VPS4B and ALIX inhibited the release 
of both enveloped and non-enveloped HAV (Fig. 3c), suggesting these 
share a common egress pathway and that non-enveloped virus in 
extracellular fluids might be derived from eHAV. Knockdown of 
HRS and TSG101, which are ESCRT-0 and ESCRT-I proteins involved 
early in cargo recruitment, did not inhibit eHAV release (Fig. 3b, c). 
Thus, eHAV release is dependent on ESCRT-associated proteins but 
not on the entire ESCRT machinery, in a similar manner to the 
budding of some enveloped viruses*’*. Although an interaction of 
ALIX with the ESCRT-III protein CHMP4B facilitates the bud- 
ding of human immunodeficiency virus”, CHMP4A, CHMP4B or 
CHMP4C knockdown minimally decreased eHAV release (Sup- 
plementary Fig. 8). 

Structural proteins of enveloped viruses interact with ESCRT-related 
proteins by means of Pro-rich late (“L’) domain motifs: PPXY, P(S/T)AP 
or (L)YPXj/3L (refs 8, 17). Only two domains in the 2,227-residue-long 
HAV polyprotein conform to these motifs: conserved, tandem YPX3L 
motifs separated by 28 residues in the VP2 capsid protein (Fig. 3e). 
YPX;,/3L motifs mediate interactions with ALIX (ref. 19), and their pres- 
ence in VP2 is consistent with a requirement for ALIX. Disrupting either 


Figure 2 | eHAV circulates in the blood of HAV-infected humans and 
chimpanzees. a, Distribution of HAV RNA in an iodixanol gradient loaded 
with early, acute-phase serum from patient BH12. b, Buoyant density of HAV 
particles from plasma (red open circles) and faeces (black open circles) of an 
experimentally infected chimpanzee, x0293 (ref. 5). Faecal HAV RNA is shown 
10°; plasma RNA is shown X10? (also see Supplementary Fig. 4). The low 
buoyant density of circulating virus was not due to a passive effect of serum, 
because the buoyant density of non-enveloped virions was not altered by 
suspension in 90% serum (data not shown). 
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Figure 3 | eHAV biogenesis requires ESCRT-associated proteins. 

a, Immunoblots of HRS, TSG101, VPS4B and ALIX in HAV-infected cells 72h 
after transfection with the indicated gene-specific targeting (T) or non- 
targeting (NT) control siRNAs. Asterisks, non-specific bands. RT-PCR assays 
confirmed knockdown of TSG101 and VPS4B (Supplementary Fig. 6c). b, Viral 
RNA detected in culture fluids 48-72 h after siRNA transfection. c, Relative 
yield and buoyant density of virus released from siRNA transfected cells (as in 
a) determined in iodixanol gradients. RNA associated with eHAV particles 
(fractions 8-12) was decreased by 85% and 87% by depletion of VPS4B and 
ALIX, respectively. d, One-step growth of eHAV. Huh-7.5 cells were 
transfected with the indicated siRNAs for 3 days, then infected with gradient- 
purified eHAV at a multiplicity of infection of 20. Cell-associated HAV RNA 
was assayed by qRT-PCR. Knockdown efficiency was confirmed at the end of 
the experiment (not shown). e, Tandem YPX3L ALIX-interacting motifs (red) 


YPX3L motif by substituting Ala for Tyr (Fig. 3e) severely decreased 
virus release but did not impair viral RNA replication (Fig. 3f, g). 
Intracellular capsid antigen was similarly decreased, suggesting that 
the mutations impede capsid assembly, but the antigen was detected 
in cells transfected with Y-A(II) RNA in which the second YPX3L motif 
had been ablated (Fig. 3h). Limited capsid assembly was confirmed by 
K24F2 precipitation of RNase-protected viral RNA from detergent- 
treated cell lysates (Fig. 3i). Consistent with a role for ALIX in eHAV 
biogenesis, anti-ALIX antibody precipitated encapsidated (RNase- 
protected) viral RNA from lysates of cells transfected with wild-type, 
but not Y-A(II), RNA (Fig. 3i). Furthermore, ALIX banded at the same 
density as eHAV in iodixanol gradients (Supplementary Fig. 7d). 
Although details remain to be worked out, these data suggest that 
HAV co-opts ALIX to facilitate its envelopment in host membranes. 
Where this occurs is uncertain, but HAV particles have been observed 
within cytoplasmic vesicles in infected liver'*”. 

The cloaking of its capsid by membranes allows eHAV to evade 
neutralizing antibodies (Fig. 1G and Supplementary Fig. 9a) and 
explains how infectious virus co-exists with anti-HAV in serum”. 
However, antibodies protect against hepatitis A when passively trans- 
ferred after HAV replication has been well established in the liver?®. 
We therefore reasoned that eHAV might be neutralized after binding 
to or entering hepatocytes. Consistent with this, eHAV was neutralized 


in VP2. Below are the motif sequences in wild-type (WT, HM175/18f) and 
mutant viral RNAs. f, Viral RNA released into culture fluids 24-48 h after 
electroporation of Huh-7.5 cells with wild-type and mutant viral RNAs. A3D is 
a replication-incompetent subgenomic replicon RNA with a frameshift 
mutation in 3D?"", g, Intracellular HAV RNA after electroporation of cells with 
WT and VP2 mutant RNAs. h, Confocal microscopy showing K24F2 detection 
of capsid antigen in cells electroporated 48 h previously with WT or mutant 
RNAs. K24F2 recognizes a conformation-dependent assembled neutralization 
epitope in the viral capsid'’. i, Anti-capsid (K24F2) and anti-ALIX (Bethyl) 
antibody-mediated immunoprecipitation of encapsidated, RNase-resistant 
viral RNA in detergent-treated lysates of cells electroporated with wild-type or 
mutant HAV RNAs. RT-PCR data are means = s.e.m. for two or three replicate 
assays; all results are representative of two or three independent experiments. 


when cells were exposed to antibody either immediately before inocu- 
lation with virus or up to 6h afterwards (Fig. 4a, b). IgG and IgA anti- 
capsid mAbs neutralized eHAV when added to cultures after removal 
of the inoculum, but an IgM mAb, H14C42 (ref. 11), failed to do so 
despite potent neutralizing activity against non-enveloped HAV (Sup- 
plementary Fig. 9b). Taken together, these results show capsid is the 
target for neutralization and that eHAV membranes must rupture 
before neutralization, but not before eHAV is internalized and inac- 
cessible to H14C42 in the medium. This suggests that eHAV may be 
neutralized after endocytosis. Chloroquine efficiently blocked infec- 
tion by eHAV (but not by non-enveloped HAV) (Fig. 4c), indicating 
that endosomal acidification is important for eHAV entry, whereas 
an antibody against the HAV receptor, TIM-1 (HAVCR-1)*, inhi- 
bited both (Fig. 4d). We found eHAV membranes to be stable at 
pH5.0 (Supplementary Fig. 5b), but they could nonetheless be 
degraded in late endosomes and lysosomes”. If so, this would render 
the capsid accessible to TIM-1, which undergoes constitutive endocy- 
tosis and trafficking to late endosomes and lysosomes”, as well as to 
anti-HAV. Many details of eHAV entry and neutralization remain to 
be explained. However, TRIM-21 (tripartite motif-containing 21), 
which is implicated in intracellular neutralization of other viruses”’, 
seems to have no function in neutralization of eHAV (Supplementary 
Fig. 9c). 
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Our findings show howa ‘non-enveloped’ picornavirus promotes its 
egress in the absence of cell lysis, and reveal a previously unrecognized 
strategy by which a virus cloaks itself in host membranes to evade 
neutralizing antibodies. The host membrane enveloping eHAV is 
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Figure 4 | Extracellular eHAV is resistant to antibody-mediated 
neutralization but is neutralized by antibody after infection. 

a, Neutralization of eHAV in cells treated with anti- HAV before (pre) or after 
(post) adsorption of virus. Left, intracellular; right, extracellular. Pretreated cells 
were incubated with JC plasma (1:50-1:100) for 1h, washed extensively with 
PBS, then incubated with gradient-purified eHAV (multiplicity of infection 
about 6) for 1 h in the absence of antibody. For post-treatment, JC was added to 
the medium on refeeding after removal of the inoculum. Cells were 
subsequently re-fed at 24h intervals with fresh medium containing no 
antibody. Data are means + s.e.m. from duplicate cultures and are 
representative of six independent experiments. b, Adding anti-HAV to 
medium as late as 6h after removal of the inoculum restricts the replication of 
eHAV (left) but not that of non-enveloped HAV (right). JC antibody was added 
at intervals after removal of the inoculum, and intracellular viral RNA was 
quantified at 48 h. Results are means + s.e.m. of replicate RT-PCR assays and 
are representative of three independent experiments. c, Chloroquine (50 uM) 
blocks eHAV entry but not HAV entry. Cells were treated for 30 min before 
inoculation with virus, and were harvested at 48 h to assay intracellular HAV 
RNA. d, Anti-TIM-1 mAb 190-4 blocks entry of both eHAV and HAV. GL37 
cells were incubated with 190-4 (10-100 Lg ml ~ ') or control IgG (100 pig ml ) 
for 1 h at 37 °C, then inoculated with virus for 1 h. Intracellular HAV RNA was 
assayed at 72h. RT-PCR data are means + s.e.m. for two or three replicate 
assays; all results are representative of two or three independent experiments. 
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likely to facilitate its spread within the liver. Antibodies restrict the 
replication of eHAV when added to cultures several hours after infec- 
tion, but further studies will be needed to determine how this happens 
and whether this can explain why immune globulin and vaccines 
protect against hepatitis when administered long after exposure, when 
virus is already circulating in blood’. Our results suggest that categori- 
zing viruses into those that are enveloped and those that are not, longa 
tradition in virology’, is overly simplistic, and that efforts should be 
made to identify other examples of membrane hijacking by ‘non- 
enveloped’ viruses. 


METHODS SUMMARY 


Non-cytopathic, cell-culture-adapted HM175/p16 HAV? was propagated in Huh- 
7.5 cells. Reverse molecular genetics studies were performed with pHM175/18f, a 
molecular clone of a related cytopathic variant**. Buoyant density was assessed in 
8-40% iodixanol (Opti-Prep) gradients centrifuged at 141,000g for 48h at 4°C. 
Viral RNA was measured by quantitative PCR with reverse transcription (qRT- 
PCR) with primers targeting the 5’ untranslated region. Infectivity was quantified 
by IR-FIFA'. For RNA-mediated interference studies, cells were transfected with 
SmartPool short interfering RNAs (siRNAs) (Dharmacon) and samples were 
collected 48-72 h later for viral RNA quantification. To analyse VP2-ALIX inter- 
actions, cell lysates were prepared 48 h after electroporation of mutant and wild- 
type viral RNAs, treated with RNase and immunoprecipitated. RNA extracted 
from immunoprecipitates was assayed by HAV-specific qRT-PCR. For intracel- 
lular neutralization, cells were incubated with eHAV for 1 h at 37 °C, then washed 
extensively. Antibodies were added at intervals, and intracellular and extracellular 
HAV RNA was quantified at 48-72 h. For standard neutralization assays, virus was 
incubated with antibodies for 1 h at 37 °C, then inoculated onto cells for IR-FIFA"’. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Reagents and antibodies. Chemical reagents were purchased from Sigma unless 
otherwise noted. A human convalescent plasma (JC plasma) sample with high- 
titre anti- HAV antibody, collected 90 days after the onset of acute hepatitis A, was 
described previously’'”’. Other antibodies were obtained as follows: anti-TSG101 
(Abcam), anti-HRS (Bethyl), anti- ALIX (Pierce and Bethyl), anti- VPS4B (Sigma), 
anti-beclin-1 (Cell Signaling), anti-flotillin-1 (BD Biosciences) and anti-HAV 
mAbs K24F2 and K32F2 (Commonwealth Serum Laboratories, Victoria, 
Australia). The sources of other anti- HAV mAbs have been described previously”’. 
Anti-TIM-1 mAb 190-4 was a gift from G. Kaplan. Guinea-pig anti-peptide 
antibodies against HAV VP1 (residues 211-228) and VP2 (residues 240-259) 
were a gift from D. Sangar. 

Cells and viruses. Huh-7.5 and HepG2 (both human hepatoma cells), MRC-5 
(human lung fibroblasts), FRhK-4 (fetal rhesus monkey kidney cells), BS-C-1 and 
GL37 (ref. 30) (both African green monkey kidney) cells were cultured in DMEM 
medium supplemented with 10% FBS. Cell-culture adapted variants of the HM175 
strain of HAV, HM175/p16 (refs 9, 31) (non-cytopathic) and HM175/18f (refs 28, 
32) (rapidly replicating, cytopathic), have been described previously””’. Infectious 
virus titres were determined by a modification of the IR-FIFA described previously’® 
performed in FRhK-4 cells. 

Human samples and clinical information. All human samples and related clini- 
cal information were obtained with the approval of the Institutional Review 
Boards of the Seoul National University Bundang Hospital and the University 
of North Carolina at Chapel Hill. 

Chimpanzee samples. All chimpanzee materials were archived samples collected 
during previous studies’ conducted prior to 15 December 2011. 

HAV capsid antigen ELISA assay. HAV capsid antigen was detected with an 
ELISA modification of a previously described radioimmunoassay”. Immobilized 
antigen captured by JC antibody was detected by sequential incubation of ELISA 
plates with K24F2 (1:1,000 dilution) and horseradish peroxidase-conjugated goat- 
anti-mouse antibody (1:10,000) at room temperature (20-23 °C) for 1h each. 
After washing with PBS-T and substrate (3,3’,5,5'-tetramethylbenzidine) addi- 
tion, D459 was determined with a Synergy 2 (BioTek) microplate reader. 
qRT-PCR assay for HAV RNA. Total cellular RNA was extracted from cell 
lysates with the RNeasy Kit (Qiagen); RNA from culture supernatants and gra- 
dient fractions was extracted with the QiaAmp viral RNA Isolation Kit (Qiagen). 
HAV RNA was quantified by real-time qRT-PCR against a synthetic RNA stand- 
ard as described previously’. Intracellular viral RNA was normalized to total RNA 
abundance or glyceraldehyde-3-phosphate dehydrogenase mRNA level deter- 
mined in a SYBR green one-step qRT-PCR assay (Bio-Rad). For experiments 
involving electroporation of synthetic viral RNAs, a two-step (RT-PCR procedure 
was conducted in which complementary DNA was synthesized with (dT) 9 as 
primer, followed by RNaseH digestion and qPCR assay with primers targeting the 
HAV 5’ untranslated RNA segment. 

Virus neutralization. To assess eHAV neutralization following binding or entry 
of eHAV into cells, Huh-7 cells (FT3-7 subline) were incubated with gradient- 
purified eHAV for 1h at 37°C, then washed extensively and refed with fresh 
medium. Antibodies were added at intervals, and intracellular and extracellular 
HAV RNA was quantified 48-72 h after virus inoculation. For standard neutral- 
ization assays, virus was incubated with antibodies for 1 h at 37 °C, then inoculated 
onto FRhK-4 cells which were processed subsequently for IR-FIFA”°. 
Immunoblotting. Cells were lysed with CelLytic Lysis Buffer (Invitrogen) in the 
presence of protease inhibitor cocktail (Roche). Proteins were extracted from 
gradient fractions by precipitation with trichloroacetic acid. Immunoblots were 
performed with standard procedures and the indicated antibodies. Protein bands 
were detected with an Odyssey Infrared Imaging System (LI-COR Biosciences). 
Immunofluorescence microscopy. Huh-7.5 cells seeded on four-well chamber 
slides were fixed with 4% paraformaldehyde, permeabilized with 0.2% Triton 
X-100 in PBS, and blocked with 10% goat serum. Cells were then incubated with 
mAb K24F2 (1:300 dilution in 1% BSA) for 1 h at room temperature, washed, and 
incubated with fluorescein isothiocyanate-conjugated goat anti-mouse antibody 
(1:300; Southern Biotech) for 1h. Images were collected with a Zeiss 510 Meta 
laser-scanning confocal microscope. 

Isopycnic gradient centrifugation of virus. Cell-culture supernatant fluids were 
centrifuged at 1,000g for 10 min at 4 °C to remove cells and debris, further clarified 
by centrifugation twice at 10,000g for 30 min, and concentrated by ultracentrifu- 
gation at 100,000g for 1h at 4°C. The resulting pellet was resuspended in PBS, 


loaded onto a 8-40% iodixanol (Opti-Prep) step gradient, and centrifuged at 
141,000g in a Superspin 630 rotor for 48h at 4°C in a Sorvall Ultra-80 ultracen- 
trifuge. Approximately 20 fractions were collected from the top of the gradient; 
density was determined with a refractometer. 

Rate-zonal centrifugation of HAV in sucrose gradients. Lysates were prepared 
from HAV (HM175/p16)-infected Huh-7.5 cells, loaded onto a 7.5-45% (w/v) 
sucrose gradient in Hank’s balanced buffer, and centrifuged at 4 °C in a Beckman 
SW55i rotor for 3h at 42,000 r.p.m. (167,000g). Fractions were assayed by HAV 
antigen ELISA and HAV RNA-specific qRT-PCR to identify pentamers, procap- 
sids and mature HAV virions. 

Immunoprecipitation of HAV. Cells (2 < 10°) were lysed in the presence of 
detergent with CelLytic lysis buffer (Invitrogen), treated with 50 pgml~' RNase 
A (Qiagen) and 25 U ml ' RNase T1 (Ambion) at 37 °C for 15 min, and incubated 
overnight with antigen-specific and isotype-matched antibodies at 4 °C. Protein G 
beads were added and incubated for a further 2h. Beads were extensively washed 
with lysis buffer, and captured immunoprecipitates were subjected to RNA extrac- 
tion for qRT-PCR, or immunoblotting, as above. 

Virus isolation from iodixanol gradient fractions. Huh-7.5 cells in 12-well 
culture plates were inoculated with 10-100 ul of peak iodixanol gradient fractions 
containing eHAV recovered from human or chimpanzee serum or plasma. Cells 
were re-fed weekly, and split 1:3 at 2 weeks. RNA was extracted from supernatant 
culture fluids 8, 23 and 30 days after inoculation and assayed for HAV RNA by 
qRT-PCR as described above. To confirm viral isolates, RNA was extracted from 
supernatant culture fluids with the QIAamp Viral RNA Mini Kit (Qiagen), and 
cDNA was transcribed with random primers (Superscript III First-Strand 
Synthesis System; Invitrogen). HAV cDNA was amplified by nested RT-PCR with 
primer sets bracketing the VP1pX-2B junction (nucleotides 2908-3306 in wild- 
type HM-175) (PrimeSTAR; TaKaRa). Second-round PCR products were isolated 
from agarose gels (Mini-Elute Gel Extraction Kit; Qiagen) and directly sequenced. 
Generation and characterization of HAV VP2 mutants. Mutagenesis of the 
infectious cDNA clone, pT7-HM175/18f, containing the complete sequence of 
HM175/18f virus”, was accomplished with the Quick Change II Site-directed 
Mutagenesis Kit (Agilent). PCR-derived fragments were sequenced to ensure 
sequence fidelity. Plasmids were linearized and viral RNA was synthesized with 
the T7 Megascript Kit (Ambion). Synthetic viral RNA (101g) was mixed with 
5 X 10° Huh-7.5 cells in a 4-mm cuvette and pulsed once at 250 V, 950 uF and 
100Q in a Gene Pulser Xcell Total electroporation system (Bio-Rad). Cells and 
supernatants were harvested at subsequent intervals for RNA extraction and qRT- 
PCR assay. 

Electron microscopy. Gradient fractions (2.5 il each) were spread onto the sur- 
face of a glow-discharged 400 mesh Formvar/carbon-coated copper grid and 
allowed to adsorb for 5 min. The grid was inverted onto a droplet of 1% glutar- 
aldehyde in 0.15 M sodium phosphate pH 7.4 for 1 min, followed by rinsing with 
deionized water and staining with 3% ammonium molybdate pH 7.0. The grid was 
air-dried before examination in a LEO EM910 transmission electron microscope 
(Carl Zeiss SMT, Inc.) operating at 80 kV. Digital images were acquired with a 
Gatan Orius SC1000 CCD Digital Camera and Digital Micrograph 3.11.0 software 
(Gatan, Inc.). 

siRNA knockdown of protein expression. Huh-7.5 cells were transfected with 
gene-specific SmartPool siRNAs (Dharmacon; see Supplementary Table 1) with 
SilentFect (Bio-Rad) transfection reagent. Knockdown efficiency was determined 
at 48-72 h by qRT-PCR (SYBR Green One-step qRT-PCR Kit; BioRad) measure- 
ment of mRNAs, or by immunoblotting as described. 
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Landscape of the PARKIN-dependent ubiquitylome 
in response to mitochondrial depolarization 
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The PARKIN ubiquitin ligase (also known as PARK2) and its 
regulatory kinase PINK1 (also known as PARK6), often mutated 
in familial early-onset Parkinson’s disease, have central roles in 
mitochondrial homeostasis and mitophagy’*. Whereas PARKIN 
is recruited to the mitochondrial outer membrane (MOM) upon 
depolarization via PINK1 action and can ubiquitylate porin, 
mitofusin and Miro proteins on the MOM'*"’, the full repertoire 
of PARKIN substrates—the PARKIN-dependent ubiquitylome— 
remains poorly defined. Here we use quantitative diGly capture 
proteomics (diGly)'*’* to elucidate the ubiquitylation site specifi- 
city and topology of PARKIN-dependent target modification in 
response to mitochondrial depolarization. Hundreds of dynami- 
cally regulated ubiquitylation sites in dozens of proteins were iden- 
tified, with strong enrichment for MOM proteins, indicating that 
PARKIN dramatically alters the ubiquitylation status of the mito- 
chondrial proteome. Using complementary interaction proteo- 
mics, we found depolarization-dependent PARKIN association 
with numerous MOM targets, autophagy receptors, and the pro- 
teasome. Mutation of the PARKIN active site residue C431, which 
has been found mutated in Parkinson’s disease patients, largely 
disrupts these associations. Structural and topological analysis 
revealed extensive conservation of PARKIN-dependent ubiquity- 
lation sites on cytoplasmic domains in vertebrate and Drosophila 
melanogaster MOM proteins. These studies provide a resource for 
understanding how the PINK1-PARKIN pathway re-sculpts the 
proteome to support mitochondrial homeostasis. 

PARKIN, a RING-HECT hybrid E3 ubiquitin ligase’*, is recruited 
to the MOM in cancer cell lines, mouse embryo fibroblasts (MEFs) and 
primary neurons to promote mitophagy in response to mitochondrial 
depolarization'*’*"°. This process requires PINK1, which binds the 
MOM translocase (TOMM) complex and phosphorylates PARKIN on 
multiple residues to promote MOM localization and ligase activation’””°. 
The best understood PARKIN substrates are MFN1/2 and RHOT1/2 
(MIRO in Drosophila), MOM-tethered GTPases whose PARKIN- 
dependent proteasomal turnover alters fission-fusion cycles and 
microtubule-dependent trafficking of mitochondria, respectively’? *'°". 
The MOM porin proteins VDAC1/2/3 are also ubiquitylated by 
PARKIN, and are required for PARKIN localization on mitochondria 
through a poorly understood mechanism’*’. Proteomic studies of 
purified mitochondria have also identified additional proteins whose 
abundance is either decreased or increased upon PARKIN activation 
or mitochondrial depolarization’, but precisely how these proteins are 
regulated and the extent to which ubiquitin is involved is unclear. 
Consequently, we do not have a comprehensive understanding of cel- 
lular PARKIN targets, which is critical for elucidating how PARKIN 
promotes mitochondrial homeostasis. Moreover, for the vast majority 
of E3 ubiquitin ligases, including PARKIN, the extent to which ubiqui- 
tin transfer is site specific and signal dependent is largely unknown, 
and a global understanding of site specificity across a wide range of 
substrates is lacking for any E3. Such information is necessary for 


decoding the cellular and molecular topology of E3 function and for 
defining how the ubiquitin system re-sculpts the proteome. 

We set out to systematically identify cellular PARKIN-dependent 
ubiquitylation targets and the dynamics of modification in a site- 
specific manner using quantitative diGly (QdiGly) proteomics. QdiGly 
merges antibody-based capture of ‘diGly remnant’-containing pep- 
tides (marking ubiquitylated proteins after trypsinolysis) and stable 
isotopic labelling with amino acids in culture (SILAC) to identify 
ubiquitylation sites that are dynamically induced, in this case, upon 
mitochondrial depolarization'*’’. To overcome inherent stochastic 
sampling for low-abundance diGly peptides (see Supplementary 
Fig. 1a, b)'*’’, we performed 73 control and QdiGly profiling experi- 
ments in four cell lines with and without elevated PARKIN levels 
(~tenfold) (Supplementary Fig. 2a) in a three-tiered manner, varying 
the length of depolarization with carbonyl cyanide m-chlorophenyl 
hydrazone (CCCP) (Fig. la and Supplementary Table 1). In some 
cases, protein turnover was blocked by proteasome inhibition with 
bortezomib (Btz) or autophagy inhibition with bafilomycin A (BafA) 
(Supplementary Table 1). Elevated PARKIN levels were used. to 
increase the likelihood of capturing signal-dependent and mechanis- 
tically relevant ubiquitylation events that would otherwise be below 
the level of detection due to low stoichiometry or abundance. 

Using HCT116 cells expressing a haemagglutinin-derived epitope 
tag (HA)-fused with PARKIN (HCT1 16PARKIN Tier 1), we performed 
34 control and QdiGly experiments (Supplementary Table 1), quan- 
tifying 4,772 non-redundant ubiquitylation sites in 1,654 proteins 
(Fig. la, b). In 18 of these experiments using Btz or CCCP/Btz for 
1h, we identified 443 diGly sites (261 proteins) with logo(heavy:light 
(H:L)) ratios=1.0 (twofold change) in at least one experiment 
(Fig. 1b, c, Supplementary Fig. 2a—c, Supplementary Tables 1, 2), with 
a Pearson’s correlation of 0.69 for two representative experiments 
(Supplementary Fig. 2e). Sixteen additional Tier 1 experiments with 
depolarization times up to 10h identified 537 non-redundant diGly 
sites (304 proteins), including 192 diGly sites (138 proteins) also iden- 
tified at 1h (Supplementary Fig. 2d). Comparison of log,(H:L) ratios 
for 48 Tier 1 sites in 36 proteins at 1 and 8 h post CCCP treatment from 
parallel experiments revealed persistent or increased ubiquitylation for 
34 sites when proteasome activity is blocked (Supplementary Fig. 2f). 

To systematically determine PARKIN dependence and compare 
targets in distinct cell lines, we performed 6 QdiGly experiments in 
HeLa versus HeLa? “®*N cells (Tier 2) with CCCP/Btz (1 h) and reverse 
SILAC labelling (Fig. 1a, Supplementary Tables 1, 2). As expected*’, 
CCCP-dependent MEN2 poly-ubiquitylation in HeLa cells required 
PARKIN expression (Supplementary Fig. 2a). We identified 582 
PARKIN-dependent diGly peptides (303 proteins) (Fig. 1b), with a 
Pearson’s correlation of 0.88 for duplicate samples (Supplementary 
Fig. 2e) and significant overlap across biological triplicates (Fig. 1d). 
Importantly, 165 diGly sites (99 proteins) were common to Tiers 1 
and 2 with 1h of depolarization (Fig. le). This increased to 191 
sites (144 proteins) when all Tierl and Tier 2 data were compared 
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a Tier 1 b Proteins Allsites 1h CCCP/Btz 
Sites ID Proteins ID Sites quant quant (proteins) sites (proteins) 
HCT116PARKIN Cell line (FDR,%)  (FDR,%) (FDR,%) (FDR,%) log,(H:L)>1_ log,(H:L)>1 
Tier 3 Tier 2 HCT11 PARKIN 6,934 (0.61) 1,993 (0.85) 4,772 (0.23) 1,654 (0.36) 807 (438) 443 (261) 
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Figure 1 | QdiGly proteomics for PARKIN-dependent ubiquitylation. biological triplicates (1h CCCP + Btz) (Supplementary Tables 1, 2). Ex, 


a, Proteomics work flow. b, diGly sites identified and quantified across 73 

experiments. FDR, false discovery rate. c, Log.(H:L) plots for quantified diGly 
peptides for HCT116°“"“ (experiment 17) or HeLa’“®*'N (experiment 57) 
cells (Supplementary Table 2). d, Overlap of ubiquitylation sites in HeLa’“*SN 


(Supplementary Table 2). We refer to the overlapping set of ubiquity- 
lation sites with 1 h of depolarization and their associated proteins as 
Class 1 candidate PARKIN-dependent targets, whereas proteins found 
in both cell lines but with different sites of ubiquitylation are referred 
to as Class 2 (Supplementary Table 2). Whereas a twofold increase in 
HiL ratio was used as a threshold for regulated ubiquitylation, many 
diGly sites were induced 30-60-fold upon depolarization (Fig. 1f, 
Supplementary Table 2), indicating highly dynamic target modifica- 
tion via PARKIN. 

Class 1 targets were enriched in mitochondrial proteins (P< 
1.76 X 10 '7), although numerous cytoplasmic proteins were also 
identified (Fig. 2, Supplementary Fig. 2i). In total, 60 Class 1 and 2 
targets were linked with mitochondria or endoplasmic reticulum-type 
membranes, including 36 MOM proteins. Candidate cytoplasmic tar- 
gets included proteasome subunits, the VCP/p97 ATPase, and pro- 
posed autophagy adaptors (SQSTM1, CALCOCO2 (also known as 
NDP52), and TAXIBP1). 

We next performed 22 control and QdiGly profiling experiments 
using HCT116 or neuronal SH-SY5Y cells (Tier 3, Fig. la, b, Sup- 
plementary Tables 1, 2), which show primarily mono-ubiquitylated 
forms of MFN2 ubiquitylation upon depolarization, apparently due to 
low levels of endogenous PARKIN (Supplementary Fig. 2a). Cumula- 
tively, 838 diGly sites (391 proteins) in HCT116 and 337 diGly sites 
(235 proteins) in SH-SY5Y had log,(H:L) = 1.0 (Fig. 1b), with exten- 
sive overlap across Tier 1 and 2 data sets (Supplementary Fig. 2g) 
and biological triplicates in SH-SY5Y (1 h/CCCP) (Supplementary 
Fig. 2h). Fifteen sites in 12 Class 1 or 2 targets were found in both cell 
lines, and 29 and 27 CCCP-dependent diGly sites in 27 and 17 Class 1 
or 2 proteins were detected in SH-SY5Y or HCT116 cells, respectively 
(Fig. 2, Supplementary Tables 1, 2). Additionally, 124 Tier 3 sites were 
identified in 29 Class 1 or 2 proteins that were distinct from Class 1 
sites. Thus, many PARKIN-dependent diGly targets were confirmed at 
endogenous PARKIN levels. Ubiquitylation of several candidate sub- 
strates was demonstrated by immunoblotting, and ubiquitylation was 
reduced upon small-interfering RNA (siRNA)-mediated depletion of 
PARKIN for several targets tested (Supplementary Fig. 3a—d). 


experiment. e, Ubiquitylation site and protein overlap between all 
HCT116°4®*N and HeLa’“*“™ experiments treated with CCCP and 

Btz for 1h. f, log,(H:L) ratios for selected diGly sites from HCT1 167425 
(Ex17) and HeLa®“®*N (Ex57) (1h CCCP + Btz). 


To complement QdiGly proteomics, we used anti-HA affinity 
purification-mass spectrometry (AP-MS)” to identify depolarization- 
dependent high-confidence candidate HA~PARKIN-interacting proteins 
(HCIPs) in duplicate experiments in 293T’“*"' cells (Fig. 3a, Sup- 
plementary Fig. 4a, Supplementary Table 3), with further validation in 
HeLa’“8"N cells (Supplementary Fig. 4b, Supplementary Table 4) 
(see Methods). Because PARKIN action is dynamic, we examined 
several time points (1, 4 or 8h) after depolarization in the presence 
or absence of Btz or BafA. To allow a semiquantitative assessment of 
interactions, we compared bait-normalized average assembled pep- 
tide spectral matches (APSMs) for proteins that were HCIPs under 
at least one condition examined unless otherwise noted (Fig. 3a, 
Supplementary Fig. 4a, Supplementary Tables 3, 4). Without depola- 
rization, no HCIPs were common to both wild-type PARKIN bio- 
logical duplicates, consistent with PARKIN auto-inhibition before 
activation” (Fig. 3a, Supplementary Text). Remarkably, upon depola- 
rization, four classes of HCIPs were identified in at least one condition 
examined (Fig. 3a, b): (1) 20 MOM proteins, including MFN1/2, 
RHOT1/2 and VDACs, HK1/2, the fission protein FIS1 and its inter- 
acting protein TBC1D15, and the translocase components TOMM20 
and TOMM70; (2) the autophagy adaptors SQSTM1 (also known as 
p62), CALCOCO2 and TAX1BP1; (3) subunits of the catalytic and 
regulatory particles of the proteasome; and (4) the VCP (also known 
as p97) ATPase implicated in MFN1 turnover’. Nineteen of the 26 
mitochondrial and autophagy proteins and 39 of 40 proteasome sub- 
units found in 293T cells were also identified in at least one condition 
examined in HeLa cells (Fig. 3b, Supplementary Fig. 4b). Depolarization- 
dependent interaction of Flag-PARKIN with selected endogenous 
HCIPs was verified by anti-Flag immunoprecipitation and immuno- 
blotting (Supplementary Fig. 5). Notably, 21 of the 27 mitochondrial 
and autophagy receptor proteins found associated with PARKIN, as 
well as eight subunits of the regulatory particle of the proteasome and 
VCP, were also identified as Class 1 or 2 PARKIN-dependent ubiqui- 
tylation targets (Figs 2, 3b, Supplementary Table 2). 

PARKIN activation is thought to reverse auto-inhibition by its amino- 
terminal ubiquitin-like domain (UBL), which has been proposed to 
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Figure 2 | PARKIN-dependent ubiquitylation sites revealed by QdiGly 
proteomics. Class 1 sites are in black font. Additional sites found in Class 1 
proteins are in red (HCT1 16485) or blue font (HeLa ***), Site overlap in 
HCT116 and/or SH-SY5Y: magenta, orange, and green octagons. Dotted lines: 
interacting proteins. Rectangles represent Class 2 substrates. Red or blue boxes 
refer to additional sites identified in either HCT116°4®“ and HeLa?“®8'N 
cells (Supplementary Table 2). MIM, mitochondrial inner membrane. * and 
*, protein levels decrease or increase, respectively, upon depolarization’. 


WTPARKIN §& 
“Tt 448 


148 
148148148 


Figure 3 | PARKIN associates with mitochondrial proteins and the 
proteasome in response to depolarization. a, Heat map of HCIPs 
(represented by APSMs) for HA—Flag—-PARKIN and mutants in 
response to depolarization in 293T cells, with or without Btz or BafA. 
Proteins indicated had weighted and normalized D-scores = 1.0, 
Z-score = 5, and APSMs = 2 unless otherwise noted (see Methods). 
b, Summary of PARKIN- interacting proteins in 293T and HeLa and 
integration with diGly sites. 
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sterically hinder ubiquitin-conjugating (E2) enzyme interaction with 
the catalytic centre*’”’. We explored the role of these regions in MOM 
recruitment, MFN2 ubiquitylation, substrate binding and proteasome 
association by using five mutants (Supplementary Fig. 6a): (1) AUBL, 
lacking the UBL domain (residues 1-80), (2) K27/48R, which blocks 
ubiquitylation of two regulated diGly sites in the UBL (Fig. 2), (3) S65A 
and S65E, which block PINK1-dependent phosphorylation of the UBL 
domain implicated in PARKIN activation in vitro'’, and (4) C4315, 
removing the active site Cys in RING2. The Parkinson’s disease patient 
mutation PARKIN(C431F) fails to localize to mitochondria in res- 
ponse to depolarization’*”®. Consistent with this, PARKIN(C431S) 
failed to localize to mitochondria, efficiently poly-ubiquitylate MFN2, 
or interact robustly with MOM substrates in response to depolariza- 
tion (Fig. 3a, Supplementary Fig. 6a—d). In contrast, K27/K48R, S65A 
and S65E mutants showed near wild-type activity for localization, 
MEN2 ubiquitylation and interaction with MOM proteins at 1h post 
depolarization (Fig. 3a, Supplementary Fig. 6a-d, Supplementary 
Tables 3, 6), although we cannot rule out a kinetic effect of these muta- 
tions. PARKIN(AUBL) displayed readily detectable association with 
MOM substrates in the absence of depolarization, consistent with some 
loss of auto-inhibition”, but nevertheless showed enhanced substrate 
association upon depalitization, indicating residual PINK1 depend- 
ence despite the absence of the UBL and S65. However, MFN2 ubiqui- 
tylation was reduced compared with wild-type PARKIN, suggesting 
reduced ubiquitylation activity. PARKIN(AUBL) also lacked a robust 
increase in proteasomal binding (Fig. 3a). 

Our data indicate that PARKIN interacts with and promotes the 
site-specific ubiquitylation of numerous mitochondrial and cyto- 
plasmic proteins, thereby extending previous studies examining abun- 
dance of mitochondrially enriched proteins'*. To place candidate 
PARKIN-dependent ubiquitination targets into a structural and func- 
tional framework, we have created an interactive resource that integrates 
dynamic ubiquitylation data with available structural information 
(Supplementary Fig. 7b-e and Methods). We explored the spatial rela- 
tionship between protein localization, structure and conservation for 
90 Class 1 sites on 36 candidate PARKIN substrates (Fig. 4a—d, Sup- 
plementary Fig. 7a, Supplementary Table 5). We found that 90% of 
ubiquitylated lysine residues are conserved in mouse (Mus musculus), 
78% in zebrafish (Danio rerio), and 51% in the 29 conserved ortho- 
logues in D. melanogaster, which harbours an active PARKIN-PINK1 
pathway’*. Moreover, 38% of sites were conserved in all three species, 
and 22 of 36 proteins have at least one site conserved from human to 
D. melanogaster (Fig. 4a—d, Supplementary Fig. 7a). DiGly sites were 
observed on multiple structural elements («-helices, B-sheets and 
loops) and no obvious sequence motif was identified by MOTIFX, 
indicating that PARKIN specificity is driven primarily by substrate 
recruitment and/or proximity rather than specific target sequences. 

Of the 17 MOM targets whose structures or membrane orienta- 
tions are defined, all identified ubiquitylation sites are presented on 
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Figure 4 | Structural anatomy and conservation of PARKIN-dependent 
diGly sites. Structures or domain schematics are shown for Class 1 
depolarization and PARKIN-dependent diGly sites (PDB codes, 
Supplementary Table 5). a, Legend for colour-coded circles that indicate the 
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the cytoplasmic surface (Fig. 4c, d), consistent with the idea that 
PARKIN re-sculpts the MOM proteome via its cytoplasmic recruit- 
ment. Fifty-nine percent of Class 1 and 2 targets had one or two sites 
of ubiquitylation, whereas 41% of targets had 3-15 sites that were 
PARKIN-dependent on the basis of Tier 2 analysis (Supplementary 
Fig. 7f). Of the eight Class 1 and Class 2 sites identified in MFN1, all but 
two are located in helical motifs flanking the two carboxy-terminal 
membrane-spanning regions, indicating localized ubiquitylation (Fig. 2 
and Fig. 4d). We detected 14 Class 1 and 2 sites in HK1, but unlike 
MENI, these sites were comparatively delocalized across both globular 
domains in HK1 (Fig. 4c). Similarly, TOMM70 was decorated exten- 
sively over its 10 cytoplasmic TPR motifs (Figs 2, 4d). Although we 
cannot exclude the possibility that some ubiquitylation events occur 
downstream of PARKIN activation, the fact that we find many of the 
targets as PARKIN-interacting proteins (Fig. 3b) is consistent with 
direct ubiquitylation (see Supplementary Text). An understanding of 
how PARKIN ubiquitylates topologically diverse proteins and domains 
within proteins requires further study but will be facilitated by the site- 
specificity data reported here. 

PARKIN seems to promote rapid turnover of some targets but not 
others. In total, 12 of the Class 1 and 2 proteins identified here (see 
Fig. 2), like MFN1/2 and RHOT1/2'*”""", probably undergo protea- 
somal turnover, including C1QBP1 (Supplementary Fig. 3c, d), FIS1 
and CISD1, as their total levels are also decreased upon PARKIN 
activation’. In contrast, previous studies* have identified 50 preferen- 
tially cytoplasmic proteins or protein complexes that are enriched in 
mitochondria in response to depolarization. Consistent with this, we 
identified 11 of these proteins or components of protein complexes as 
Class 1 or 2 targets (Fig. 2 and Supplementary Fig. 7g), possibly reflect- 
ing an under-appreciated dynamic recruitment process affecting mito- 
chondrial homeostasis. These include the autophagy adaptor p62, 
previously linked with mitochondrial clustering after damage”, as well 
as TBC1D15 and DNMLI, which are known to interact with FIS1 to 
regulate fission—fusion cycles”. In addition to p62, we also identified 
candidate autophagy receptors, CALCOCO2 and TAXI1BP1, which 
are ubiquitylated upon depolarization and associate with PARKIN 
in the presence of BafA (Figs 2, 3). Because CALCOCO2 has been 
shown to target bacteria for autophagy”, it is an attractive candidate 
for involvement in mitophagy. We speculate that these related auto- 
phagy adaptor proteins may be dynamically recruited to the MOM and 
may link the depolarization response to the recruitment of autophagy 
machinery. Additionally, it is possible that additional cytoplasmic 
proteins identified here as PARKIN-dependent targets are transiently 
recruited to mitochondria but below the level of detection in previous 
studies. It is also likely that ubiquitylation of PARKIN targets alters 
their enzymatic properties or functions before their turnover by the 
proteasome or autophagy. Potential PARKIN-regulated functions 
include: (1) fission-fusion cycles (FIS1, TBC1D15), (2) small mole- 
cule transport (VDACs), (3) apoptosis (FKBP8, MCL1, BAX), (4) 
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conservation of Lys at homologous positions in M. musculus, D. rerio and D. 
melanogaster (Supplementary Table 5). For structures, regulated sites are 
shown in red space-filled models. b, Cytoplasmic proteins. RPN10, red circle. 
c, MOM proteins with structures. d, MOM proteins without structures. 
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iron-sulphur shuttling (CISD1), (5) protein translocation (TOMM/70) 
or (6) proteasome assembly or activity (PSMC subunits) (see Sup- 
plementary Text)*°”’. 

This work provides systematic target identification and ubiquityla- 
tion site-specificity for PARKIN, thereby revealing the diversity and 
complexity of PARKIN function on the MOM and in the cytoplasm. 
This resource will enable the development of methods that detect 
specific diGly sites in PARKIN targets in situ, thus enabling a deeper 
understanding of how PARKIN controls mitochondrial homeostasis 
in tissues relevant to Parkinson’s disease. 


METHODS SUMMARY 


Cells were grown in Lys(K)-free DMEM/dialysed fetal bovine serum, with light 
(KO) or heavy (K8) Lys (50 pg ml ')”, and treated with Btz (11M), CCCP 
(101M), and/or BafA (50nM). Mixed cells were lysed in 8M urea, 75mM 
NaCl, 50 mM Tris HCl (pH 8.2), and tryptic/LysC peptides immunoprecipitated 
with anti-diGly-Protein A (Cell Signaling Technologies) (1 h, 4 °C), typically with 
2-4 sequential immunoprecipitations (Supplementary Table 1)'*. Peptides and 
ubiquitylation sites were identified by liquid chromatography-tandem mass spec- 
trometry (see Methods)’’. Extracted ion chromatograms were used for quantifica- 
tion and signal-to-noise measurements’’. Interaction proteomics used 293T or 
HeLa cells stably expressing HA—Flag~PARKIN (or mutants) and CompPASS to 
identify HCIPs”. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Sample preparation for diGly capture. HCT116, HeLa or SH-SY5Y cells (10°) 
were grown in lysine-free DMEM supplemented with 10% dialysed FBS (Gibco), 
2mM t-glutamine, 1% penicillin/streptomycin (Gibco), and light (KO) lysine 
(501gml~'), which contained no heavy isotopes’. Heavy media was the 
same except the light lysine was replaced with heavy (K8, 8 Da heavier) lysine 
(-lysine:2HCl (U-C6, 99%, U-!°N2, 99%); Cambridge Isotopes) at the same 
concentration. Where indicated, cells were treated with Btz (1M), CCCP 
(10M), and/or BafA (50nM) for the times indicated. Btz was a gift from 
Millennium Pharmaceuticals. After the indicated treatments, heavy and light cells 
were mixed 1:1 by cell number and lysed in 8 ml of denaturing lysis buffer (8 M urea, 
50mM Tris (pH 8.2), 75mM NaCl, protease inhibitors (EDTA-free), Roche)'*. 
Samples were incubated on ice for 10 min and then sonicated with 3 X 10s pulses. 
We typically obtained 30-50 mg of total protein. Lysates were digested with trypsin 
as described previously** with one modification. Prior to trypsin digestion, lysates 
were diluted 1:1 with 50 mM Tris (pH 8.2) to lower the urea concentration to 4M 
and digested with 10 ng yl * Lys-C (Wako) for 2h at room temperature. 
Immunoprecipitation of diGly containing peptides. Lyophilized peptides from 
30-50 mg of digested proteins were resuspended in 1.3 ml of IAP buffer (50 mM 
MOPS (pH 7.4), 10 mM Na,HPO,, 50 mM NaCl) and centrifuged at 14,000g for 
5 min to remove any insoluble materials. The supernatant was incubated with 
anti-diGly antibody coupled to protein A agarose or acrylamide beads for 1h at 
4°C and washed with IAP buffer 3x and once with PBS as described previously’’. 
Peptides were eluted by treatment with 50 ul of 5% formic acid for 10 min twice. 
The eluted peptides were desalted using C18 stage-tip method and resuspended in 
5% formic acid before mass spectrometric analysis. Lysates were subjected to 
immunoprecipitation sequentially two or four times, unless otherwise noted in 
Supplementary Table 1. 

Mass spectrometry analysis of diGly peptides. Peptides were separated on 
100 jum X 20cm C18 reversed phase (Maccel C18 3 200A, The Nest Group) 
with a 165 min gradient of 6% to 27% acetonitrile in 0.125% formic acid”. 
The twenty most intense peaks from each full mass spectrometry (MS) scan 
acquired in the Orbitrap Velos (Thermo) were selected for MS/MS (see RAW files 
for specific settings). 

Sequest-based identification using a Human UNIPROT database followed by a 
target decoy-based linear discriminant analysis was used for peptide and protein 
identification as described'**”*'. Localization of diGly sites used a modified 
version of the A-score algorithm” as described’*. A-scores > 13 were considered 
localized. SILAC-based site quantification and signal-to-noise was performed as 
described previously’* by using extracted ion chromatograms. The heavy- and 
light-labelled peptides from each search were subsequently combined using 
custom scripts. Other parameters used for database searching include: 50 p.p.m. 
precursor mass tolerance; 1.0 Da product ion mass tolerance; tryptic digestion 
with up to three missed cleavages; and variable oxidation of Met (+15.994946). 
For an individual peptide to be used for quantification of sites and proteins, it had 
to meet one of two conditions: (1) both heavy and light isotopic envelopes must be 
detected with signal-to-noise ratios above 5.0; and (2) one isotopic envelope (heavy 
or light) must have a signal-to-noise ratio above 10.0. Equal mixing ratios for 
heavy to light cells were confirmed by calculating the mean log,(H:L) ratios in 
non-diGly non-lysine containing peptides and subtracting that from the corres- 
ponding log,(H:L) diGly ratios if necessary. Keratins were removed from site lists. 
Contributions of peptide identifications to various protein isoforms were con- 
densed into a single UniProt descriptor using principles of parsimony. Mass 
spectrometry RAW files are available upon request. 

Interaction proteomics. Interaction proteomics was performed essentially as 
described previously, but with small modifications”. Briefly, 293T or HeLa cells 
were transduced with a lentiviral vector expressing HA-Flag~PARKIN (NP_ 
004553.2) or the designated PARKIN mutants, and stable cell lines selected in 
puromycin. Cells from 4 15cm dishes at 80% confluence were treated with 
CCCP (10 uM), Btz (1M), and/or BafA (50nM) as indicated and after the 
indicated time, cells were collected and lysed in 3ml of 50mM Tris-HCl 
(pH7.4), 150mM NaCl, 0.5% Nonidet P40, and protease inhibitors. Cleared 
lysates were filtered through 0.45—m spin filters (Millipore Ultrafree-CL) and 
immunoprecipitated with 30 pl anti-HA resin (Sigma). Complexes were washed 
4x with lysis buffer, exchanged into PBS for a further three washes, eluted with 
HA peptide, reductively carboxymethylated, and precipitated with 10% trichloro- 
acetic acid (TCA). TCA-precipitated proteins were trypsinized, purified with 
Empore C18 extraction media (3M), and analysed by liquid chromatography- 
tandem mass spectrometry (LC-MS/MS) with a LTQ-Velos linear ion trap mass 
spectrometer (Thermo) with an 18 cm? 125m (ID) C18 column and a 50 min 
8-26% acetonitrile gradient. All AP-MS experiments in 293T cells were performed 
in biological duplicates and for each biological experiment, complexes were ana- 
lysed twice by LC-MS to generate technical duplicates. AP-MS experiments in 
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HeLa cells were performed on a single IP but with technical duplicates. Spectra 
were searched with Sequest against a target-decoy human tryptic UniProt-based 
peptide database, and these results were loaded into the Comparative Proteomics 
Analysis Software Suite (CompPASS) to identify high confidence candidate inter- 
acting proteins (HCIPs)”. Here, a statistics table derived from analogous AP-MS 
data for 166 unrelated proteins was used to determine weighted and normalized 
D-scores (WD*-score) as well as Z-scores based on spectral counts. The D-score 
measures the reproducibility, abundance and frequency of individual proteins 
detected in each individual analysis. 

To identify PARKIN-associated proteins, we filtered proteins at a 2% false 

discovery rate for those with a WD"-score = 1.0, Z-score=5, and average 
assembled peptide spectral matches (APSMs) = 2 in both biological duplicates. 
In addition to the initially identified proteins, APSMs for interactors with WD¥- 
scores greater than 1 under at least one condition examined, or with a Z-score 
greater than 10 in both biological duplicates, were plotted in heat map form using 
Multi-experiment Viewer software (MeV). Due to space limitations, a subset of 
proteins which had a WD%-score=1.0 in at least one experiment were 
omitted from Fig. 3 but are contained in the heat maps in Supplementary Fig. 4 
as well as Supplementary Tables 3 and 4. TAX1BP1 that was identified in PARKIN 
immunoprecipitates and also found to be ubiquitylated but did not pass the 
stringent cut-off for passing our scoring scheme was also displayed in the heat 
map. We note that whereas several proteins passed the WD'-score cut-off 
for PARKIN-associated proteins in the absence of depolarization in indivi- 
dual experiments, there was essentially no overlap within biological duplicates 
(Supplementary Table 4). In addition, for proteins with a maximum of 1-3 spec- 
tral counts, it is possible that those proteins were missed under some conditions 
due to stochastic sampling of low abundance proteins by LC-MS (for example, 
CYB5R3, TOM22, and RHOT1 with wild-type PARKIN at 8h post-CCCP in 
the presence versus the absence of BafA, Fig. 3a). The following proteins were 
identified in only 1 biological duplicate as being associated with PARKIN but were 
included in the heat map (Fig. 3a) as they are also present as Class 1 diGly targets: 
SQSTM1, TAXIBP1, TBC1D15, MYO6, VCP, RHOT2, HK2, MDH2, CYB5R3 
and ACSL4. 
Immunological methods and microscopy. To validate interactions between 
PARKIN and candidate interacting proteins (Supplementary Fig. 3), 293T cells 
stably expressing HA-Flag-PARKIN were either left untreated or treated with 
CCCP (10 4M) or CCCP (10 1M) and Btz (1 uM) for 1h. Extracts (50mM Tris- 
HCl (pH7.4), 150 mM NaCl, 0.5% Nonidet P40, and protease inhibitors) from 
cells were subjected to immunoprecipitation with anti-Flag resin (M2 agarose; 
Sigma), and washed complexes subjected to immunoblotting with the indicated 
antibodies. To examine MFN2 ubiquitylation, extracts from cells lysed in dena- 
turing lysis buffer (8M urea, 75mM NaCl, 50mM Tris (pH 8.0), and protease 
inhibitors) expressing the indicated HA-Flag-PARKIN mutant proteins were 
separated on a 4-12% gradient SDS-polyacrylamide gel (PAGE) and blotted with 
anti-HA or anti-MFN2. To examine PARKIN levels in the cell lines used 
(Supplementary Fig. 1), cells were lysed in denaturing lysis buffer and protease 
inhibitors and extracts subjected to SDS-PAGE and immunoblotting. To examine 
ubiquitylation of candidate PARKIN substrates by immunoblotting, HCT116 cell 
lines stably expressing the candidate substrates C-terminally tagged with HA-Flag 
epitopes were transfected with either control siRNA or an siRNA targeting 
PARKIN (1: 5'-GGAGUGCAGUGCCGUAUUU-3’, 2: 5’-GUAAAGAAGCGU 
ACCAUGA-3’, 3: 5'-GGAGCCAUCGCCUAUGAAA-3’, 4: 5’-GUAAAGAAG 
CGUACCAUGA-3’, Thermo Scientific). After 72 h, cells were treated with CCCP 
(10 1M) with or without Btz to block proteasomal turnover of targets. At the 
indicated times, cells were lysed in 8M urea and subjected to immunoblotting 
using anti-HA antibodies. Identical gels were run and probed with anti-PCNA asa 
loading control. The antibodies used are as follows: anti-TOMM70, (Proteintech, 
14528-1); anti-MFN2, (Epitomics, 3272-1 or Sigma, WH0009927M3); anti- 
ADRM1, (Enzo Scientific, BML-PW9910); anti-Rpn10, (Enzo Scientific, BML- 
PW9250); anti-HK1, (Cell Signaling, C35C4); anti-PSMD2, (Affinity BioReagents, 
PA1-964); anti-PARKIN, (PRK8, Santa Cruz, sc-32282). 

To examine the localization of PARKIN and various PARKIN mutants, HeLa 
cells stably expressing HA-Flag-tagged proteins were plated on No.1 coverslips, 
treated with CCCP (101M, 1h), and fixed with 4% paraformaldehyde before 
immunofluorescence using anti-HA to detect PARKIN proteins and anti- 
TOMM20 to detect mitochondria. All images were collected with a Yokogawa 
CSU-X1 spinning disk confocal on a Nikon Ti-E inverted microscope equipped 
with 100 Plan Apo numerical aperture 1.4 objective lens. HA-PARKIN fluo- 
rescence was excited with the 491 nm line (selected with an AOTF) from Spectral 
Applied Precision LMM-7 solid-state laser launch. Emission was collected with 
a quad band pass polychroic mirror (Semrock) and a Chroma ET525/50m 
emissions filter. TOMM20 fluorescence was excited with the 561 nm line from 
the LMM-7 launch, and emission collected with the Semrock polychroic and a 
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Chroma ET620/60m emission filter. Images were acquired with a Hamamatsu 
ORCA-AG cooled CCD camera controlled with MetaMorph 7 software. Nine 
z-series optical sections were collected with a step size of 0.2 jm, using the inter- 
nal Nikon Ti-E focus motor. z-series were deconvolved using AutoQuant 
blind deconvolution software, and are displayed as maximum z-projections. 
Gamma and brightness were adjusted on displayed images (identically for com- 
pared image sets) and percent colocalization was calculated by thresholding 
for signal at least 4 standard deviations above background in a single z slice 
(identically for compared image sets) using MetaMorph 7 software (Supplemen- 
tary Table 6). 

Structural analysis of ubiquitylation sites and web portal. Using the UniProt 
identifier for each protein identified, the corresponding PDB file(s) (if available) 
were determined using data from the UniProt to PDB cross-reference table (http:// 
www.ebi.ac.uk/pdbe/docs/sifts/quick.html). The amino acid position in the struc- 
ture corresponding to each ubiquitylation site was calculated based on their rela- 
tive position in the structure using the information provided by the cross reference 
table. Data for each experiment was stored in MySQL tables, which are used when 
accessing the data via the web portal. For rendering of the structural information 
via the web portal, Jmol: an open-source Java viewer for chemical structures in 3D 


(http://www.jmol.org/) was used. Access to the web portal is available at harper. 
hms.harvard.edu. 

Gene ontology. Gene ontology analysis was performed using DAVID”. 
Additional sub-cellular localization parameters as designated in Fig. 2 were per- 
formed manually using MITOCARTA, Human Protein Atlas, and GenBank. 
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A conformational switch in HPI releases 
auto-inhibition to drive heterochromatin assembly 


Daniele Canzio'?, Maofu Liao!, Nariman Naber!, Edward Pate*, Adam Larson!*, Shenping Wu!, Diana B. Marina’*, 
Jennifer F. Garcia!*, Hiten D. Madhani', Roger Cooke!, Peter Schuck°, Yifan Cheng! & Geeta J. Narlikar! 


A hallmark of histone H3 lysine 9 (H3K9)-methylated heterochro- 
matin, conserved from the fission yeast Schizosaccharomyces 
pombe to humans, is its ability to spread to adjacent genomic 
regions’ °. Central to heterochromatin spread is heterochromatin 
protein 1 (HP1), which recognizes H3K9-methylated chromatin, 
oligomerizes and forms a versatile platform that participates in 
diverse nuclear functions, ranging from gene silencing to chro- 
mosome segregation’ °. How HP1 proteins assemble on methylated 
nucleosomal templates and how the HP1-nucleosome complex 
achieves functional versatility remain poorly understood. Here we 
show that binding of the key S. pombe HP1 protein, Swi6, to methy- 
lated nucleosomes drives a switch from an auto-inhibited state to a 
spreading-competent state. In the auto-inhibited state, a histone- 
mimic sequence in one Swi6 monomer blocks methyl-mark recog- 
nition by the chromodomain of another monomer. Auto-inhibition 
is relieved by recognition of two template features, the H3K9 methyl 
mark and nucleosomal DNA. Cryo-electron-microscopy-based 
reconstruction of the Swi6-nucleosome complex provides the overall 
architecture of the spreading-competent state in which two unbound 
chromodomain sticky ends appear exposed. Disruption of the switch 
between the auto-inhibited and spreading-competent states disrupts 
heterochromatin assembly and gene silencing in vivo. These findings 
are reminiscent of other conditionally activated polymerization pro- 
cesses, such as actin nucleation, and open up a new class of regulatory 
mechanisms that operate on chromatin in vivo. 

HP1 has two structured domains, a chromodomain (CD) and a 
chromoshadow domain (CSD), connected by an unstructured hinge 
region (Fig. la). The CD recognizes the trimethylated H3K9 (H3K9me3) 
mark’, whereas the CSD can homodimerize'®’ and binds specific 
protein sequences'*™*. The hinge is implicated in sequence-independent 
RNA and DNA binding’*"’. Here we investigate how the key S. pombe 
HP1 protein Swi6 uses its different domains to create a regulatable 
HP1-chromatin complex. 

It is proposed that heterochromatin spread relies on the ability of 
HP! proteins to self-associate on chromatin’’. To understand how 
Swi6 self-association is regulated by chromatin, we first characterized 
the individual oligomerization equilibria in the absence of nucleo- 
somes using analytical ultracentrifugation (AUC). Previous work has 
characterized at least three Swi6 oligomeric states: a monomer, a dimer 
mediated by CSD-CSD interactions and higher-order oligomers 
mediated by CD-CD interactions between dimers'®!*!*"7"*, Analysis 
of our AUC data best describes the system as a two-step self-association 
process: a tight association of two Swi6é monomers with an affinity 
constant, Ke. (iKe™ <1nM, at 8 °C), followed by progressive self- 
association of Swi6 dimers with an identical chain-elongation affinity 
constant, Ke, (1/Kis. ~70 uM, at 8 °C) (dim indicates dimerization, 
iso indicates isodesmic and obs indicates observed; Fig. 1b-d and 
Supplementary Figs 1, 2 and 3). This second step, also known as 


isodesmic self-association, is analogous to the self-association of tubu- 
lin dimers”. 

We next tested whether the most distinguishing feature of the chro- 
matin template, the H3K9 methyl mark, increases Swié oligomeriza- 
tion when it occupies the CD. An increase in oligomerization would be 
reflected by an increase in the overall weighted average sedimentation 
coefficient, s,,, of Swi6 as a function of H3K9me3 peptide (Fig. le). In 
contrast to our simplest expectation, addition of the methylated 
peptide reduced the value of s,,, implying that Swi6 self-association 
is inhibited by the methylated H3 tail peptide (Fig. le). This result 
suggested that the methylated H3 tail peptide and the CD-CD inter- 
face may compete for the same site. We noticed that the CD of Swi6 
contains a sequence (92-Ala-Arg-Lys-Gly-Gly-Gly-97) on a loop that 
resembles the amino acid sequence of the H3 tail surrounding the 
Lys 9 position (7-Ala-Arg-Lys-Ser-Thr-Gly-12) (Fig. 1f). Interestingly, 
although the Swi6 sequence degenerates in higher organisms to just the 
lysine and proximal glycine (Fig. 1f), in human HP1 isoforms the lysine 
shows post-translational modifications found on H3K9, such as mono- 
methylation and acetylation”. We therefore proposed that the ARK 
loop from the CD of one Swi6é could occupy the H3K9-binding site in 
another CD to mediate CD-CD self-association in solution (Fig. 1g). 
This is reminiscent of observations that the HP1 CD can bind ARK- 
containing motifs in histone H1 and G9a proteins*!”’. 

To test this model, we investigated the effects of replacing the Arg 93 
and the Lys 94 residues with alanines (Swi6'°**; Fig. 1g and Sup- 
plementary Table 1) on oligomerization. As predicted by the model, 
the Swi6’°°PX mutant showed a small but reproducible (threefold) 
decrease in the isodesmic affinity constant (Kis?) (Fig. 2a and Sup- 
plementary Fig. 4). Interestingly, we noticed a substantially larger 
(14-fold) reduction in the association constant for dimerization (Ko) 
(Fig. 2b and Supplementary Fig. 4). Thus, in addition to the previously 
identified CSD-CSD interface, the ARK loop-CD interaction also parti- 
cipates in stabilizing a Swi6 dimer. We further found that Swi6'°°?* 
binds tail peptides approximately sixfold more strongly than wild-type 
Swi6 (Swi6” ’) (Fig. 2d), and that Swi6 dimerization is weakened with 
saturating methylated H3 tail peptide (Supplementary Fig. 4). These 
results indicate that the ARK loop-CD interaction is mutually exclu- 
sive with H3 tail binding. 

The above data suggest that a Swié dimer can exist in at least two 
states: a closed state in which the ARK loop engages the CD of its 
partner Swi6 and an open state in which the ARK loop-CD interaction 
is broken (Fig. 2c). Self-association of dimers then consists of: (1) a 
conformational step between closed and open states (K°"); and (2) a 
self-association step between dimers in the open state (K°!8°). For 
Swi6’’ the measured isodesmic association step (Kis°) is a product 
of K™ and K*8° (Fig. 2c). In Swi6'°°P the effect on dimerization 
masks the destabilizing effect of the loop mutations on the actual 
oligomerization step (Kelso) (Fig. 2c). 
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Figure 1 | Dissecting Swi6 self-association equilibria. a, Swi6 domains. H, 
hinge; N, N-terminal region. b, Sedimentation equilibrium AUC analysis of 
Swi6™” self-association. Interference profiles at different rotor speeds are 
shown. Every twentieth point is shown. c, Sedimentation velocity AUC analysis 
of Swi6" self-association. T, temperature. For b and c, best fits for global 
analysis using an isodesmic self-association model are shown. d, Model of Swi6 
self-association. S, Swi6 monomer. e, [Swi6”"] = 20 uM. Sedimentation 
coefficients: dimer (S.) = ~4S; tetramer (Sy) = ~5.2S. f, Top, Swi6 CD 
modelled on Drosophila HP1 CD with H3K9me3 peptide (Protein Data Bank 
(PDB) code, 1KNE). Bottom, H3 tail (amino acids 4-14) and CD loop regions 
of Swi6 (amino acids 72-97), Drosophila (d) HP1« and human (h) HP1o and 
HP1. Conserved lysine in red. g, Top, models for CD-CD and CD-ARK loop 
interactions. H3 tail, green line; methylation, red circle. Middle, schematic of 
Swi6-H3 tail interactions (left) and of hypothetical CD-CD interactions 
(right). Grey oval, region of negative charge; brown oval, m-cation interactions. 
Bottom, mutants used. 


To investigate the extent of similarity between the ARK loop—CD 
interaction and the H3-CD interaction, we used two additional 
mutants. The first is Swi6 28%, in which an aromatic cage residue 
important for H3K9me3 binding* is mutated to alanine (Fig. 1g and 
Supplementary Table 1). The second is Swi6*'% in which an acidic 
stretch amino-terminal to the first aromatic cage residue of the CD is 
mutated to alanines (Fig. 1g and Supplementary Table 1; rationale in 
Supplementary Discussion). Both mutants show reduced binding to 
H3K9me3 peptides (Fig. 2d). These mutants also destabilize Swi6 
oligomerization and dimerization (Fig. 2a, b and Supplementary 
Fig. 4), suggesting that similar interactions are involved in the H3- 
CD and ARK loop-CD interfaces (Fig. 1g). 

We next used electron paramagnetic resonance (EPR) spectroscopy 
to ask whether disruption of the ARK loop-CD interface or binding of 
the H3K9me3 tail makes the loop more mobile by stabilizing the open 
conformation (Fig. 2c, e). Changes in the mobility of a site-specifically 
attached spin probe can give well-defined changes in its EPR spectrum”’. 
We mutated all three native cysteines in Swi6 to serines (Swi6?>; 
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Supplementary Table 1), mutated the Gly 95 residue on the loop to a 
cysteine, and then modified it with a maleimide spin probe (SwiePr’e; 
Supplementary Table 1). Mutating the native cysteines destabilized 
oligomerization, H3 peptide binding and nucleosome binding (Sup- 
plementary Fig. 5), but the mutants still showed substantial discrim- 
ination for the H3K9 methyl mark (Supplementary Fig. 5). 

Two spectral components were observed for Swi6?°?* “7, one with 
higher mobility and one with reduced mobility. Deconvolution of 
the two components gave the fraction of probes that are immobilized. 
In parallel, AUC experiments confirmed the oligomeric state of the 
protein. For the Swi6?t°?* 7 protein, ~35% of the probes were 
immobile (Fig. 2f). Compared to Swi6P'°’*-W7 the fraction of im- 
mobile probes decreased in SwiGProbeLoopX GwigProbe-AcidicX and 
Swi6P™ePeP™meX (L315D; Supplementary Table 1), which disrupts 
CSD-CSD dimerization and increases monomeric Swi6 (ref. 18) 
(Fig. 2f and Supplementary Fig. 5). The values obtained for the 
mutants relative to wild type are consistent with our thermodynamic 
characterization (Fig. 2a, b). Further, as predicted by the model 
(Fig. 2c, e), addition of the H3K9me3 peptide decreased the fraction 
of immobile probes. The H3K9me3 peptide was ~100-fold better at 
decreasing the immobile probe fraction compared to both H3K9 and 
H3K4me3 peptides, indicating that the effect was specific for the 
H3K9me3 mark (Fig. 2g; additional mutants in Supplementary Fig. 5). 

To investigate the global structure of Swi6 dimers, we used negative- 
stain electron microscopy. To increase the mass for visualization by 
electron microscopy, and to identify the N terminus of Swi6, we fused a 
cyan fluorescent protein (CFP) molecule to the N terminus of Swi6é 
(Fig. 3a and Supplementary Fig. 6). The CFP-Swi6 construct showed 
an extended conformation (Fig. 3a). We reasoned that the proximity of 
the CFP tag to the CD perhaps disrupts the ARK loop-CD interaction. 
Consistent with this reasoning, CFP-Swi6 forms a ~tenfold weaker 
dimer than Swi6”” (Fig. 2b and Supplementary Fig. 6). To maintain 
the ARK loop-CD interaction, we moved the CFP tag to the carboxy 
terminus (Fig. 3b). Swi6-CFP has a similar dimerization constant to 
Swi6™", consistent with having an intact ARK loop-CD interaction 
(Supplementary Fig. 6), shows a more condensed structure compared 
to CFP—-Swi6 and has a lower sedimentation coefficient (Fig. 3b and 
Supplementary Fig. 6). These results raised the possibility that the 
extended conformation of CFP-Swi6 reflects the open state (Fig. 2c), 
which is capable of binding methylated nucleosomes. We therefore 
visualized Swi6 bound to a methylated nucleosome using cryo-elec- 
tron microscopy, and for comparison, visualized nucleosomes alone 
(Fig. 3c). On the basis of our previous biochemical knowledge, we 
applied two-fold symmetry to the Swi6-nucleosome complex to obtain 
the three-dimensional reconstruction (see also Supplementary Methods). 
For the nucleosome and the Swi6-nucleosome complexes, three- 
dimensional reconstructions were calculated using the nucleosome 
structure as an initial model to an overall resolution of ~15A and 
~25 A, respectively (Fig. 3c, Supplementary Fig. 7 and Methods). 

The 25A resolution of the Swi6-nucleosome complex precludes 
conclusions about the detailed conformations of the bound Swi6 
dimers. We instead further analysed the difference density between 
the complex and nucleosome (Fig. 3c). Although we cannot rule out 
that Swi6 binding alters nucleosome conformation, the difference den- 
sity has roughly the mass (~125 kDa) of two Swi6é dimers (~150 kDa) 
as determined previously’®. We thus assume that the difference density 
is mainly contributed by the bound Swi6é dimers. The putative location 
of the CD suggests that one CD engages an H3 tail and one CD 
protrudes out in solution (Fig. 3d). This arrangement of the CDs is 
compatible with the sticky ends architecture proposed previously’* 
(Fig. 3e). The putative location of the CSD dimer suggests that this 
domain may also engage the nucleosome (Fig. 3c, d). This possibility 
has also been previously suggested***®. To test it directly, we measured 
binding of the Swi6”"*™ to methylated nucleosomes, and observed 
that disruption of the CSD dimer decreases binding by tenfold (Fig. 4a 
and Supplementary Fig. 8). 
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Figure 2 | Impact of disrupting H3 tail mimic—CD interaction. 

a, b, Isodesmic association constant (Kio (a) and dimerization association 
constant (K¢™) (b) for Swié mutants (values in Supplementary Fig. 4). c, Model 
for the self-association of Swi6. (K°°™ = [open]/{closed]). K°!8°is the isodesmic 
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Figure 3 | Electron microscopy studies of Swi6 and Swi6-H3K-9me3 
nucleosome complex. a, b, CFP—Swi6 (a) and Swi6-CFP (b). Representative 
two-dimensional class averages are shown. c, Two different views of three- 
dimensional reconstruction of the Swi6-H3K-9me3 (Kc indicates derivation of 
this analogue by alkylation of a cysteine residue, see Methods) nucleosome 
complex (left), nucleosome (middle) and difference map between the two 
reconstructions (right). Nucleosome crystal structure (PDB code, 1KX5) was 


Swi6-nucleosome complex 


Molar mass (kDa) 


fluorescence anisotropy (bottom) studies (values in Supplementary Fig. 8). 

e, Location of MSL (4-maleimido-2,2,6,6-tetramethyl-1-piperidinyloxy) probe 
on G95C (yellow circle). f, Sedimentation velocity AUC (left panels) and EPR 
analyses (right panels) of Swi6P®°P* “7 and Swi6PrP* +°°P*, Representative 
EPR spectra shown as derivative of absorbance (y axis) versus magnetic field (x 
axis). c(M), molar mass distribution. Errors for probe immobilized <10%. 

g, Impact of 18-nucleotide H3 peptides on probe immobilization. 
[Swi6P?°>-WT) = 20 uM. For all panels, error bars represent s.e.m. (n = 3). 


Nucleosome Difference density 


fitted into reconstruction. Isosurface of nucleosome three-dimensional 
reconstruction at high threshold in dark blue and low threshold in light blue 
(nucleosome type used in Methods). H3, red; difference map, yellow. 

d, Putative locations of Swi6 domains docked into difference map: CD of Swi6é 
(black; PDB code, 2RSO; amino acids 72-142), CSD domain of Swi6 (red; PDB 
code, 1EOB) and hinge (H). e, Proposed locations of the two unoccupied CDs. 
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Surprisingly, in contrast to the results with the H3 tail peptides 
(Fig. 2d), disrupting auto-inhibition through the Swi6'°°* mutant 
reduced binding to methylated nucleosomes by tenfold, even though 
discrimination for the methyl mark was maintained (Fig. 4a and 
Supplementary Fig. 8). This suggested that, when displaced from the 
CD, the ARK loop may help Swi6 make additional interactions with 
the nucleosome. We tested whether the positively charged ARK loop 
assists interactions with DNA. We found that Swi6™ " binds ~fourfold 
tighter than Swi6'°P* to a 20-base pair DNA duplex (Fig. 4b). Fur- 
ther, Swi6* bound to the DNA ~fourfold tighter with saturating 
H3K9me3 peptide, consistent with the loop being available when dis- 
placed from the CD (Fig. 4b and Supplementary Fig. 8). In contrast to 
Swi6PX, Swi6 “4X, which binds H3K9me3 peptides more weakly 
than Swi6™", also binds methylated nucleosomes ~sevenfold more 
weakly (data not shown). 

On the basis of the above data, we propose that binding to methy- 
lated nucleosomes has two coupled effects: (1) release of ARK loops to 
directly or indirectly help DNA binding; and (2) release of two CDs 
that can bridge nearby nucleosomes (Fig. 4c). This new model revises 
our previously proposed model’* (see Supplementary Discussion). Our 
data imply that the cooperative action of the CD, the CSD-CSD dimer 
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Figure 4 | Nucleosome recognition and in vivo impact of disrupting loop- 
CD interaction. a, b, Nucleosome binding assayed by fluorescence anisotropy 
(a) and affinity constants for 20-base-pair DNA (b). Kg values are in 
Supplementary Fig. 8. Error bars represent s.e.m. (” = 3). mP, millipolarization 
units. c, Model for conformational switch in Swi6é upon binding methylated 
nucleosomes. d, Top, schematics of centromere 1 showing ura4? reporter. cnt, 
centromere; L, left; otr, outer-most repeats; R, right. Bottom, silencing assay 
using ura4* reporter. e, swi6k?P* and swi64“'* mutants decrease H3K9 
methylation levels at the centromeric dg on a dcr1A background. Error bars 
denote s.e.m from three independent immunoprecipitations. f, Model depicts 
how conformational versatility of the HP1-chromatin platform enables 
recruitment of diverse regulators that promote (yellow, red, blue and green 
cartoons) or inhibit (grey cartoon) heterochromatin spread. 
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and the ARK loop couples the assembly of Swi6 to the recognition of 
specific features of the nucleosomal template such as H3K9 methyla- 
tion and nucleosomal DNA. This coupling can ensure correct targeting 
to H3K9-methylated chromatin and reduce aberrant spread in euchro- 
matin. Interestingly, the loop that stabilizes the auto-inhibited state 
assists in binding nucleosomes when in the open state. These mutually 
exclusive roles of the loop may enable switch-like behaviour in HP1 
spreading. 

To test the relevance of this model in vivo, we investigated the 
impact of the LoopX and AcidicX mutants in assembling a functional 
heterochromatin structure. As these mutants concomitantly weaken 
oligomerization and nucleosome binding, we expected to observe loss- 
of-function effects in vivo. We first investigated effects on the silencing 
of a ura4* reporter gene inserted at the pericentromeric imr (inner- 
most repeat) region (Fig. 4d). Both mutants show defects in silencing 
that are comparable to the swi6™ deletion strain and that are not due to 
reductions in protein levels (Fig. 4d and Supplementary Fig. 9). Next, 
we investigated effects at endogenous centromeric dg repeats. In the 
absence of the RNA interference (RNAi) machinery, Swi6 is important 
for maintaining high levels of H3K9 methylation at dg repeats”’. 
Although deletion of RNAi components causes a small but repro- 
ducible decrease in H3K9 methylation, further deletion of swi6 causes 
a much larger decrease in H3K9 methylation”’. We find that the loopX 
and acidicX mutants also show large decreases in H3K9 methylation 
in the absence of an RNAi component such as dicer (dcr1) (Fig. 4e and 
Supplementary Fig. 9). These data imply that the ARK loop-CD inter- 
action is important for the integrity of H3K9-methylated heterochro- 
matin in vivo. Our results with the LoopX and AcidicX mutants are 
also consistent with previous work showing that mutations in these 
regions affect mitotic stability and mating type switching’’, both of 
which depend on the integrity of heterochromatin. 

The ability of Swi6 to exist in more than one discrete conformational 
state may allow it to interact with different regulators through the 
CSD-CSD interface, the hinge or the ARK loop, and this could alter 
the stability and structure of the Swi6-nucleosome platform (Fig. 4f). 
The Ala-Arg-Lys-Gly-Gly-Gly sequence is absent in the other S. pombe 
HP1 protein, Chp2, and this difference may in part explain the differ- 
ent biological roles of Chp2 and Swi6 (ref. 28). In Swi6, the ARK loop 
stabilizes the auto-inhibited state even though the lysine is not methy- 
lated, presumably due to the high effective concentration of the ARK 
loop relative to its partner CD. However, human HP1« contains just 
the Lys-Gly residues of the Ala-Arg-Lys-Gly-Gly-Gly sequence and, in 
this context, the lysine can be monomethylated in vivo”. It is tempting 
to speculate that the methylation energetically compensates for the loss 
of the arginine while also making the interaction more regulatable. 
Protein assemblies that are controlled by release of auto-inhibition 
have been well characterized in processes such as actin nucleation 
and protein tyrosine kinase activation””°. We anticipate that similarly 
sophisticated mechanisms govern the assembly, spread and functions 
of HP1-mediated heterochromatin. 


METHODS SUMMARY 


Swi6 was purified from Escherichia coli as described previously'*. Except where 
specified, all experiments were performed in 20 mM HEPES, pH 7.5, 150 mM KCl 
and 1 mM dithiothreitol. Nucleosomes were prepared using recombinant Xenopus 
laevis histones with and without a methyl lysine analogue at H3K9. Silencing 
assays were performed at 30°C for 2-3 days. The Ab1220 (Abcam) was used 
for H3K9me2 chromatin immunoprecipitation. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Protein cloning and purification. Swi6 proteins were purified from Escherichia 
coli as described previously’*. Except for the CFP-tagged proteins, all other Swi6é 
protein purifications yield final proteins that are devoid of N- or C-terminal tags. 
Protein concentrations of all Swi6 construct samples were measured by ultraviolet 
absorption at 280 nm and calculated using the experimentally determined extinc- 
tion coefficient (see AUC section). To ensure that there was minimal DNA con- 
tamination, we measured the 260/280 ratio for every purified protein, which on 
average was ~0.5. 

Reaction buffer conditions. Except where specified, all experiments were per- 
formed in the reaction buffer consisting of 20 mM HEPES, pH 7.5, 150 mM KCl 
and 1 mM dithiothreitol. 

Nucleosome assembly. Core nucleosomes were assembled on 147 base pairs (bp) 
of DNA using the 601 positioning sequence, containing a Pst1 site 18 bp in from 
the 5’ end. For the cryo-electron microscopy of nucleosome-alone studies, 207 bp 
of DNA containing the 601 sequence at one end was used. All nucleosomes were 
prepared using recombinant Xenopus laevis histones and assembled as described 
previously’'. Methyl lysine analogue containing H3 histones at position 9 
(H3Kc9me3) were prepared as described previously”. 

Tryptophan fluorescence studies. The association between Swi6 proteins and the 
H3 peptides (amino acids 1-18) were measured following the increases in the 
internal fluorescence of Trp 104 (one of the three residues in the aromatic cage) 
using an ISS K2 fluorimeter at 30 °C. Samples containing 200 nM Swié6 in reaction 
buffer were mixed with increasing concentrations of each H3 peptide, trimethy- 
lated or unmethylated at lysine 9. After an incubation for 10 min at 30 °C, the 
fluorescence of Trp 104 was measured with the incident wavelength of 295 nm. 
The observed fluorescence intensity (Fops) at 330 nm was plotted as a function of 
peptide concentration. A 1:1 binding model was fit to the data using GraphPad 
Prism and the following set of equations: 


(Finax [H3,] + FminKa) 
((H3,] Tv Ka) 


F, ‘obs 


in which Fynax is the fluorescence at saturating peptide, Fmin is the fluorescence in the 
absence of peptide and H3, represents the H3 tail peptide. The obtained dissoci- 
ation constant (K4) values were averaged over three independent sets of data. 
Fluorescence polarization studies. Fluorescence polarization-based measure- 
ments of binding to H3 tail peptides (amino acids 1-15), DNA and nucleosomes 
were performed in reaction buffer with 0.01% NP-40 at 24 °C. In total, 5-10 nM of 
peptide, DNA or nucleosomes were used and Swi6 concentrations were varied. 
The binding reaction was incubated for 30 min at 24°C and fluorescence polari- 
zation was measured using an Analyst HT (Molecular Devices) with excitation and 
emission wavelengths of /., = 480nm and A... = 530 nm, respectively. The H3 
peptide was labelled at the N terminus with a fluorescein probe (FAM). The 
peptide was synthesized by GenScript. The 20-base-pair DNA used in the DNA- 
binding assay was 5’-labelled with 5,6 carboxy-fluorescein (IDT). The DNA to 
assemble fluorescent nucleosomes was labelled on one end by amplifying the 
sequence using PCR with a primer covalently linked to 6-carboxyfluorescein by 
a 6-carbon linker (IDT). All data were analysed using GraphPad Prism. 
The peptide and DNA-binding data were fit by the following equation: 


(Fimax [Swi6] + FninKa) 
([Swi6] + Ka) 


In which F,, is the fluorescence polarization signal observed, Fynin is the fluor- 
escence polarization signal for the probe alone (peptide or DNA) and Fyyax is the 
fluorescence polarization signal at saturating [Swi6]. The obtained Kg values were 
averaged over three or more independent sets of data. 

The following model was used to fit the nucleosome-binding data to account for 
Swi6-Swi6 oligomerization that is scaffolded by the Swié-nucleosome complex. 
Because the fluorescent probe is located only on one end of the DNA (asterisks), 
we made the assumption that changes in fluorescence polarization reflect binding 
of Swi6 on one side of the nucleosome. We proposed that binding to the other side 
occurs independently and is invisible to our assay: 


Fobs 


D+D+D+D+N == D+D+D+D-N 


| = | 


D+D+D+N-D === D+D+D-N-D ==> DD-N-DD 
K’ 
d d 


In which D indicates a Swi6 dimer, D-N-D is the Swi6-nucleosome complex 
and DD-N-DD is the Swi6-nucleosome complex bound by additional Swi6é 


dimers. The fluorescence polarization nucleosome data were fitted using the 
following equation: 
(FoKj Kj + F, [Swi6] Kj + F, [Swi6]*) 


Fobs 1p 7 2 TAR. 
(K}K? + [Swi6]K3 + [Swi6]”) 


In which F,,, is the fluorescence polarization signal observed, Fo is the fluorescence 
polarization signal for the nucleosome alone, F, is the fluorescence polari- 
zation signal of the saturated Swi6-nucleosome complex, F, is the fluorescence 
polarization signal due to the oligomerization of Swi6 scaffolded by the Swi6- 
nucleosome complex, Kj is the dissociation constant for the Swi6-nucleosome 
complex and Kj is the dissociation constant for Swi6-Swi6 scaffolded by the 
Swi6-nucleosome complex. Data comparing Swi6™ to the mutants were globally 
analysed: the Fo, F, and F, were fixed among all proteins whereas the Kj and K} 
were floated. 

AUC studies. Swi6 proteins were individually dialysed into reaction buffer 
overnight. Swi6 proteins were quantified by ultraviolet absorption at 280 nm. 
We experimentally determined the Swi6”” extinction coefficient by recording 
both interference fringes and ultraviolet absorbance at 280 nm for a given Swi6 
sample. We then converted the number of interference fringes observed into 
milligrams of Swié per millilitre using an average refractive increment of 
4.1 fringesmg ‘ml. Using this estimated concentration and the absorbance 
value at 280nm, we then calculated the extinction coefficient at 280nm to be 
36,880 M_'cm™'. Simultaneous detection of protein by ultraviolet absorbance 
at multiple wavelengths allowed for the determination of the extinction coefficients 
at 230 nm and 250 nm (13,650 M~'cm™! and 221000M_' cm’, respectively). 

All sedimentation experiments were conducted using an analytical ultra- 
centrifuge (Beckman Coulter) equipped with either sole absorption optical scan- 
ner (Optima XLA) or both absorption and interference optics scanner (Optima 
XLI). Data were acquired with ProteomeLab data acquisition software 5. 
Global analysis of sedimentation equilibrium and sedimentation velocity iso- 
therm data was performed using the SEDPHAT software. Error estimates were 
calculated on the basis of replicates of three or more experiments and confidence 
intervals based on F-statistics and the error projection method. Partial-specific 
volume (v), solution density (9) and solution viscosity (7) were calculated in 
SEDNTERP. 

Sedimentation equilibrium. Sedimentation equilibrium experiments were con- 
ducted at 8°C in an Optima XLI/A at rotor speeds of 6,000, 11,000 and 18,000 
r.p.m. in double-sector centrepieces with sample volumes of 170 pil. Loading con- 
centration of Swi6, in monomer units, was varied from 1.7HM to 32 uM. 
Absorbance data, at wavelengths of 280, 250 and 230 nm, and immunofluores- 
cence data were acquired from samples at five different loading concentrations at 
all rotor speeds. Global analysis of data at different wavelengths and rotor speeds 
was conducted with the software SEDPHAT, using Boltzmann exponentials repre- 
senting the predicted concentration profiles of each species in chemical equilib- 
rium, with amplitudes at all radii constrained by the mass action law: 


Cot = C1 +nKet + S2* , (n+im)Ker (Ler)! 


in combination with the method of implicit mass conservation, using the bottom 
position of each solution column as an adjustable parameter. In the above equa- 
tion, Cror is the total protein concentration, c, is the concentration of Swi6 
monomer, n is the number of Swi6 subunits that self-associate in the first-step 
of association (n = 2 for dimer formation), m is the molecularity of the chain- 
elongation unit (m = 2 for a dimeric chain-elongation unit), K is the association 
constant for Swi6 dimerization (Ki), L is the association constant for Swi6 
isodesmic chain elongation (Ki?) and i is the number of units of m added to 
the initial complex formed by self-association of n Swié subunits. Summation of 
terms was carried out to a relative numerical precision of 10°. 

Sedimentation velocity. Samples volumes of 400 il at an overall final optical 
density (OD) between 0.1 and 1.0, were pipetted into double-sector centrepieces 
and inserted in an eight-hole rotor, which was placed in the temperature pre- 
equilibrated AUC chamber. An additional incubation period of 1-2h was added 
with the rotor at rest and under vacuum for temperature equilibration. For experi- 
ments performed at 4°C, the samples were left equilibrating under vacuum 
overnight. Runs were performed at a rotor speed of 50,000 r.p.m. for more than 
12h. Scans were collected following ultraviolet absorption at 230, 250 and 280 nm, 
scanned with a radial step size of 0.003 cm in continuous mode, and/or using the 
interference system. Data were analysed using a c(s) continuous distribution of 
Lamm equation solutions with the software SEDFIT, followed by integration and 
assembly into an isotherm of weighted-average s-values. The isotherm was 
modelled in SEDPHAT with mass-action-based models for the weighted-average 
s-value 
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assuming a power law for the sedimentation coefficients of oligomeric species with 
Kk = 0.566 (consistent with increasingly elongated oligomers; this value was pre- 
determined from the global fit of sedimentation equilibrium and sedimentation 
velocity on an extensive data set), in combination with an overall hydrodynamic 
non-ideality term of magnitude k, = 0.01 ml gt. As in the equation above, ctor is 
the total protein concentration, c, is the concentration of Swié monomer, n is the 
number of Swi6 subunits that self-associate in the first-step of association (n = 2 
for dimer formation), m is the molecularity of the chain elongation unit (m = 2 for 
a dimeric chain elongation unit), K is the association constant for Swi6 dimeriza- 
tion (K2™), L is the association constant for Swié isodesmic chain elongation 
(Ki82), s; is the sedimentation coefficient of Swi6 monomer and s,, is the sedi- 
mentation coefficient of Swi6é dimer (n = 2). 

Rationale for different temperatures. (1) To obtain a model for Swi6 self- 

association we performed sedimentation equilibrium and sedimentation velocity 
AUC studies at 8 °C. Sedimentation equilibrium experiments are ~1 week long, so 
a temperature of 8 °C was used to stabilize the protein. Global analysis of both 
sedimentation equilibrium and sedimentation velocity AUC at 8 °C allowed us to 
obtain a thermodynamic information for Swi6 self-association as well as hydro- 
dynamic parameters for Swié6 monomer, dimer and oligomers that were used in all 
the sedimentation velocity experiments performed at higher temperatures. (2) To 
compare dimerization properties between Swi6 mutants, we had to perform the 
experiments at 30 °C because dimerization is too tight at lower temperatures. (3) 
To stabilize probe-labelled Swi6 proteins, the EPR and AUC experiments were 
done at 4 °C. 
EPR studies. EPR measurements were performed with an EMX EPR spectrometer 
(Bruker Instruments). First derivative, X-band spectra were recorded in a high- 
sensitivity microwave cavity using 50-s, 100-G wide magnetic field sweeps. The 
instrument settings were as follows: microwave power, 25 mW; time constant, 
164ms; frequency, 9.83GHz; modulation,1G at a frequency of 100kHz. 
Each spectrum used in the data analysis was an average of 10-40 50-s sweeps 
from an individual experimental preparation. Swi6** was labelled by reacting 
the sole cysteine residue (either K94C or G95C) with the EPR probe 4-malei- 
mido-2,2,6,6-tetramethyl-1-piperidinyloxy (MSL, Sigma Aldrich). The protein 
was first dialysed overnight in reaction buffer without dithiothreitol. It was 
then incubated with MSL using a twofold molar excess of MSL to protein con- 
centration. The mixture was then left to react for 4h at 4°C. The excess label 
was removed by a microcon concentrator, followed by an additional overnight 
dialysis step into the above buffer. The protein sample was incorporated into a 
25-ul capillary and the EPR spectrum was recorded. The temperature of the 
sample was controlled by blowing dry air (warm or cool) into the cavity and 
monitored using a thermistor placed close to the experimental sample. To stabilize 
the Swi6?™°°*" and Swi6?*°’* mutants we performed the EPR and AUC experi- 
ments at 4 °C. 

The spectra were deconvoluted into mobile and immobile spectral components 

using the protocols described in ref. 34. 
Electron microscopy and image processing. Negative-stain electron microscopy 
of CFP-Swié and Swi6-CFP. Proteins were dialysed overnight in reaction buffer. 
In total, 2.5 pl of CFP-Swi6 at 0.34 uM and of Swi6-CFP at 0.1 11M was absorbed to 
a glow-discharged copper grid coated with carbon film for 30s followed by con- 
ventional negative stain with 0.75% uranyl formate. Images were collected using 
a Tecnai T12 microscope (FEI Company) with a LaBg filament and operated at 
120-kV accelerating voltage. All images were recorded at a magnification of 
X 67,000 with an UltraScan 4096 X 4096 pixel CCD camera (Gatan). 

All images were 2 X 2 pixel binned to the final pixel size of 3.46 A before any 

further processing. A total of 5,000 and 3,000 particles for CFP-Swi6 and Swi6- 
CEP, respectively were selected from ~50 images using the display program 
SAMViewer (written by M. Liao). All subsequent image processing was performed 
using Spider* and Frealign. 
Cryo-electron microscopy studies of the nucleosome and Swi6-nucleosome 
complex. Cryo-electron microscopy data were collected using Tecnai TF20 elec- 
tron microscope equipped with a field emission gun (FEI Company) and operated 
at 120 kV (for the nucleosome) or at 200 kV (for the Swi6-nucleosome complex). 
Images were collected at a nominal magnification of X62, 000 using a TemF816 
8K X 8K CMOS camera (TVIS). 

Nucleosome alone. The nucleosome contained 60 bp of flanking DNA (147 bp 
of 601 sequence + 60 bp extra DNA) and did not contain the methyl lysine ana- 
logue on H3K9. All images were binned by a factor of 2 (2.39 A/pixel) for further 
processing. Defocus values were determined for each micrograph using CTFFIND”” 
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and ranged from —1.5,1m to —3 um. A total of 13,629 particles were selected and 
classified into 100 two-dimensional class averages. Three-dimensional reconstruc- 
tions were calculated and refined using GeEFREALIGN”*. The initial model was 
generated by filtering the atomic structure of the nucleosome (PDB code, 1KX5) 
to 35 A (command pdb2mrc from EMAN package)**. The resolution was estimated 
to be ~16.5 A, on the basis of Fourier shell correlation (ESC) = 0.5 criteria. 

Swi6-H3K-9me3 nucleosome complex. Swi6 was dialysed overnight in reac- 
tion buffer. The binding reaction was set such that, first, both nucleosome and 
Swi6 concentrations were above the Kq value measured by fluorescence polariza- 
tion (FP) and second, the Swi6 concentration was sufficient to titrate all the 
nucleosomes as assayed by native gel shift. Those same conditions were used 
previously to measure the stoichiometry of the complex by sedimentation velocity 
AUC and are known to result in homogenous samples'*. 

A total of 5,000 particles were selected and classified into 200 two-dimensional 
class averages and all were included in the final three-dimensional reconstruction. 
The cryo-electron microscopy three-dimensional reconstruction of the nucleo- 
some alone was low-pass filtered to 35 A and used as the initial model for three- 
dimensional refinement of the complex. We used our previous biochemical 
knowledge to guide the structural analysis. We have previously shown that the 
complex of Swié with an H3K9 methylated nucleosome contains two Swi6é 
dimers'*. Given the pseudo-two-fold symmetry in the positions of the H3 tails, 
the simplest model posits that the Swié dimers also bind in a pseudo-symmetric 
manner with one dimer on either side of the nucleosome. Indeed in some of the 
two-dimensional class averages we observe density on either side of the nucleo- 
some consistent with the predictions of the biochemical analysis (Supplementary 
Fig. 7). We therefore applied two-fold symmetry to obtain the three-dimensional 
reconstruction. The resolution of the final three-dimensional reconstruction was 
estimated to be ~25 A, on the basis of FSC = 0.5 criteria. This same resolution was 
also obtained when the cryo-electron microscopy three-dimensional reconstruc- 
tion of nucleosome alone was low-pass filtered to 60 A. 

All three-dimensional reconstructions were visualized by UCSF Chimera. The 
‘Fit in Map’ function of Chimera was used to dock the atomic structure of the 
nucleosome (PDB code, 1KX5) into the three-dimensional volume”. 

To calculate the difference map, the nucleosome alone and the Swi6- 
nucleosome complex maps were low-pass filtered to 25 A. The difference map 
was calculated by subtracting the nucleosome from the Swi6-nucleosome complex 
using the program diffmap.exe (provided by N. Grigorieff), which normalizes 
the density maps before calculating the difference map. The extended shape of 
the difference density is compatible with the shape of the Swi6 dimer visualized 
in the negatively stained CFP-Swié dimer (Fig. 3a). Such similarity suggested a 
model for the arrangement of the individual domains of Swié and enabled us to 
manually place the known crystal structures of the CD and CSD into the difference 
density (Fig. 3d). 

Silencing assays. The strains were grown overnight to saturation and diluted to 
OD600 nm Of 1 at the highest dilution. Serial dilutions were performed with dilution 
factor of 5 and cells were grown on non-selective (YS)- and 5-FOA (2 gl! 
5-fluoroorotic acid)-containing media for ura4* reporter at 30°C for 2-3 days. 
Quantifying Swi6é protein levels in vivo. Swi6 protein levels were quantified 
using polyclonal antibodies raised in rabbits by injecting recombinant Swi6. 
Chromatin immunoprecipitation. The chromatin immunoprecipitation assay 
was performed as described previously’. Cells were lysed at 4°C by bead beating 
seven times for 1 min each with 2-min rests on ice. Chromatin fraction was 
sonicated 20 times for 30s, each with a 1-min rest in between cycles using 
Bioruptor. Ab1220 (Abcam) was used for H3K9me2 chromatin immunoprecipi- 
tation and protein A Dynabeads were used in the washing steps. 
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Inhibition of kynurenine 3-monooxygenase (KMO), an enzyme in 
the eukaryotic tryptophan catabolic pathway (that is, kynurenine 
pathway), leads to amelioration of Huntington’s-disease-relevant 
phenotypes in yeast, fruitfly and mouse models*®, as well as in a 
mouse model of Alzheimer’s disease’. KMO is a flavin adenine 
dinucleotide (FAD)-dependent monooxygenase and is located in 
the outer mitochondrial membrane where it converts L-kynurenine 
to 3-hydroxykynurenine. Perturbations in the levels of kynurenine 
pathway metabolites have been linked to the pathogenesis of a 
spectrum of brain disorders’, as well as cancer”* and several peri- 
pheral inflammatory conditions’. Despite the importance of KMO 
as a target for neurodegenerative disease, the molecular basis of KMO 
inhibition by available lead compounds has remained unknown. 
Here we report the first crystal structure of Saccharomyces cerevisiae 
KMO, in the free form and in complex with the tight-binding inhi- 
bitor UPF 648. UPF 648 binds close to the FAD cofactor and perturbs 
the local active-site structure, preventing productive binding of the 
substrate L-kynurenine. Functional assays and targeted mutagenesis 
reveal that the active-site architecture and UPF 648 binding are essen- 
tially identical in human KMO, validating the yeast KMO-UPF 648 
structure as a template for structure-based drug design. This will 
inform the search for new KMO inhibitors that are able to cross 
the blood-brain barrier in targeted therapies against neurodegene- 
rative diseases such as Huntington’s, Alzheimer’s and Parkinson’s 
diseases. 

There is great interest in the causative role of kynurenine pathway 
metabolites in neurodegenerative disorders such as Huntington’s and 
Alzheimer’s diseases®. Several of these metabolites are neuroactive: 
quinolinic acid (QUIN) is an excitotoxin'®", 3-hydroxykynurenine 
(3-HK) generates free radicals'*, xanthurenic and cinnabarinic acids 
activate metabotropic glutamate receptors’*’* and kynurenic acid 
(KYNA) is a neuroprotectant®. KMO lies at a critical branching point 
in the pathway between the synthesis of 3-HK/QUIN and KYNA 
(Fig. 1a) and its activity has a role in the neurotoxic and neuroprotec- 
tive potential of the pathway. In the brain, KMO is expressed at low 
levels in neurons"* and is predominantly expressed in microglia’"®, the 
resident immune cells of the CNS, suggesting a link between KMO 
function and inflammatory processes in the brain. 

Inhibition of KMO activity leads to amelioration of several disease- 
relevant phenotypes in yeast, fruitfly and mouse models’. Increased 
levels of KYNA relative to neurotoxic metabolites seem critical for this 
protection. Restoring endogenous levels of 3-HK to fruitflies lacking 
KMO activity eliminates this neuroprotection’, highlighting beneficial 
effects of 3-HK reduction due to KMO inhibition. In addition, phar- 
macological inhibition of KMO is neuroprotective in animal models of 
cerebral ischaemia'”"*, reduces dystonia in a genetic model of paroxys- 
mal dyskinesia’, improves levodopa-induced dyskinesia in parkinso- 
nian monkeys” and extends lifespan in a mouse model of cerebral 
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malaria”'. Therefore, inhibition of KMO activity is an attractive thera- 

peutic strategy for several acute and chronic neurological diseases’. 
Despite interest in targeting KMO only a few potent inhibitors are 

available, and none appreciably penetrate the blood-brain barrier in 
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Figure 1 | Inhibition of KMO by UPF 648. a, Schematic overview of L-KYN 
metabolism. The KMO inhibitor UPF 648 is shown in blue. The hydroxyl 
moiety introduced by KMO is shown in red. b, Fractional velocity of 3-HK 
formation as a function of UPF 648 concentration with human and S. cerevisiae 
KMO (blue circles, human KMO; red squares, S. cerevisiae KMO). Error bars 
are standard deviation of three replica points. Inset, HPLC elution curves of 
product (3-HK) and substrate (L-KYN) at varied UPF 648 concentrations. 
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Table 1 | Kinetic and dissociation constants for human, S. cerevisiae 
and active-site variants of KMO 


Enzyme K, app UPF 648 (nM) Ka UPF 648 (uM) 
hKMO* 56.7+8 - 
ScKMO(A394) 74+14 0.14+0.01 
ScKMO(A394, R83A)+ = 3.1402 
ScKMO(A394, R83M)+ - 3.2+0.2 


* Dissociation constant (Ka) for the human (h)KMO-UPF 648 complex could not be determined by 
fluorescence emission owing to low KMO expression yield. 

+ Kinetic assays to calculate the apparent inhibition constant (kK; app) of S. cerevisiae (Sc)KMO variants 
by UPF 648 could not be carried out because enzyme activity was substantially compromised after 
mutagenesis. As an alternative, fluorescence-emission measurements were performed to determine Ky 
by ligand perturbation of flavin fluorescence. 


adult animals*””. One of these, UPF 648, has a half-maximum inhi- 
bitory concentration (IC;9) of 20 nM and provides protection against 
intrastriatal QUIN injections in kynurenine aminotransferase (KATII, 
also known as AADAT)-deficient mice”. UPF 648 treatment also 
shifts kynurenine pathway metabolism towards enhanced neuropro- 
tective KYNA formation*™, and ameliorates disease-relevant pheno- 
types in a fruitfly model of Huntington’s disease*. That known 
inhibitors do not cross the blood-brain barrier is an impediment to 
KMO-targeted drug discovery. KMO structures in complex with tight- 
binding inhibitors are required to design small-molecule inhibitors 
that can penetrate the blood-brain barrier. With this in mind, we 
determined the crystal structure of yeast KMO complexed with 
UPF 648. This enzyme-inhibitor structure can now be used to help 
develop new inhibitors of highly related human KMO. 

We expressed full-length human KMO using the insect cell bacu- 
lovirus system, which yielded small quantities (0.5 mg per litre of cul- 
ture) of detergent-solubilized active KMO. The recombinant form had 
similar kinetic constants to native KMO from pig liver mitochondria”. 
UPF 648 binds tightly to recombinant KMO (inhibition constant (K;) 
= 56.7nM). Poor stability and low expression yields, however, pre- 
vented crystallization. We therefore turned to S. cerevisiae KMO, 
which is related to human KMO (38% identity and 51% similarity). 
Expression of full-length S. cerevisiae KMO yielded a protein fragment 
(KMO(A396)) with a lower molecular weight than anticipated. Elec- 
trospray ionization mass spectrometry indicated proteolytic cleavage 
at residue 396. Subsequently, we isolated a KMO(A394) (residues 394- 
460 deleted) version of the enzyme engineered by site-directed muta- 
genesis (Methods) to define the cleavage point before crystallization 
(Supplementary Fig. 1 and Supplementary Table 1). The KMO(A394) 
enzyme was active (Supplementary Figs 2 and 3), generated authentic 
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3-HK in high-performance liquid chromatography (HPLC)-based 
assays (Fig. 1b) and was inhibited by UPF 648 (K; = 74nM) with a 
potency similar to that seen with human KMO (Fig. 1b and Table 1). 

The structure of the proteolysed form of yeast KMO (KMO(A396)) 
was determined using selenomethionine single anomalous diffraction 
(Protein Data Bank (PDB) codes 4J2W and 4J31). We also solved 
structures of KMO(A394) to 1.85 A resolution. The final model con- 
tains residues 1-97 and 101-390 and the bound FAD cofactor. Both 
crystal forms contain a putative KMO dimer in the asymmetric unit 
(Fig. 2a). The KMO fold is similar to other flavin-dependent hydro- 
xylase structures**”’, with highest structural similarity to 2-methyl-3- 
hydroxypyridine-5-carboxylic acid oxygenase” (root mean squared 
deviation, 2.3 A over 310 Cou atoms; overall sequence identity, 16%; 
Q-score, 0.43; Z-score, 15.0). An overlay of individual KMO mono- 
mers reveals variation in the position of the carboxy-terminal «-helix, 
with most monomers showing disorder beyond residue 380. The linker 
region following the second strand of the antiparallel §-sheet involved 
in substrate binding is disordered, with large variations in the positions 
of residues 96-97 and 101-104. The relative position of the FAD- 
binding domain and six-stranded antiparallel §-sheet domain is sub- 
ject to minor variation, reminiscent of domain motion coupled to 
substrate binding in other members of this family”®. 

In absence of substrate, flexibility in relative positioning of both 
domains flanking the KMO active site is also reflected in distinct con- 
formations observed for residues lining the active site (Fig. 3). The re- 
face of the FAD is connected to solvent by a narrow water-filled cavity 
that runs perpendicular to the active-site cleft (Supplementary Fig. 4). 
A structural water is located above the FAD C4a, mimicking the posi- 
tion of the C4a—peroxide intermediate formed upon reaction with 
oxygen. The dimethylbenzene moiety of the FAD isoalloxazine is pro- 
tected from solvent by Lys48 and the conserved residue Tyr 195 
(Supplementary Fig. 4). In absence of large protein rearrangements, 
this suggests that a ‘waving flavin’ motion as demonstrated in other 
FAD-dependent monooxygenases” is unlikely to occur in KMO dur- 
ing turnover. 

We were unable to obtain a KMO complex with L-kynurenine (L- 
KYN) but succeeded in co-crystallizing with UPF 648 (Supplementary 
Table 3, PDB code 4J36). The asymmetric unit contains a putative KMO 
dimer with one monomer containing UPF 648 bound in the active site, 
adjacent to the FAD re-face (Figs 2b and 3). The UPF 648 carboxylate is 
bound by conserved residues Arg 83 and Tyr 97 whereas the aromatic 
dichlorobenzene moiety is flanked by several hydrophobic residues 


Figure 2 | Yeast KMO crystal structure. a, Depiction of the S. cerevisiae KMO 
dimer. One monomer is represented in cartoon format coloured according to 
secondary structure and the second monomer is displayed in ribbon format, 
with an overlay of the five ligand-free KMO monomers. For the second 
monomer, the C-terminal region (residue 348 onwards) is coloured red and the 
flexible linker region (residues 96-104) is depicted in green. b, Depiction of 


changes induced by binding of the KMO inhibitor UPF 648. An overlay of both 
monomers from the KMO-UPF 648 structure is presented (PDB code 4J36), 
the unbound monomer coloured in grey. The UPF-bound monomer is 
coloured as in a. UPF 648 and FAD are shown as spheres colour-coded 
according to atom type. 
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E102 


Free KMO 


Figure 3 | The Saccharomyces cerevisiae KMO active site. Left panel shows 
an overlay of free enzyme structures obtained from various crystal forms, right 
panel depicts the KMO-UPF 648 interaction and middle panel shows a model 


(Leu 221, Met 230, Ile 232, Leu 234, Phe 246, Pro 321, Phe 322), which 
are conserved in many KMO enzymes. UPF 648 binding induced struc- 
tural changes in the enzyme, notably reorientation of the Pro 321- 
Gln 325 loop flanking the re-side of the FAD. A minor reorientation 
in the position of the six-stranded antiparallel B-sheet domain with 
respect to the flavin-binding domain is also evident. These changes 
result in increased disorder of the C-terminal «-helix, which is only 
visible up to Arg 359 (Fig. 2b). Reorientation of the Pro 321-Gln 325 
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of the KMO-L-KYN complex. Electron density is shown for UPF 648 (2F, - F. 
contoured at 1a). 


loop is a consequence of the active site adapting to the presence of 
vicinal chloride substituents in UPF 648, neither of which have a 
counterpart in L-KYN. To provide sufficient space for both chlorides, 
Phe 322 moves away from the active site, effectively occupying a posi- 
tion previously taken by Tyr 323. The Pro 321—Gln 325 loop reorients to 
compensate for the altered Phe 322 position. This loop lines the postu- 
lated oxygen-binding site above the re-side of the FAD, which is effec- 
tively destroyed on binding UPF 648. Binding UPF 648 was found to 
accelerate hydrogen peroxide formation by a factor of ~20-fold com- 
pared to reactions in absence of UPF 648 (Methods). This indicates a 
destabilization of the flavin C4a—hydroperoxide intermediate formed in 
the natural catalytic cycle of flavin monooxygenases in the presence of 
UPF 648 (Supplementary Table 2). 

The chemical similarity of UPF 648 and L-KYN allowed modelling 
of L-KYN in the KMO active site (Methods). Modelling suggests that 
L-KYN is bound similarly, but without effect on the Pro 321-Gln 325 
loop. The aromatic substrate moiety is located in the conserved hydro- 
phobic pocket (residues Leu 221, Met 230, Ile 232, Leu234, Phe 246, 
Pro 321, Phe 322) on the re-face of the flavin (Fig. 3). Additional polar 
contacts are formed between the conserved Gln 325 and the L-KYN 
carbonyl group, and between the substrate aniline nitrogen atom and 
the FAD O4 atom. Whereas the amino acid carboxylate is bound by 
Arg 83 and Tyr 97, the amino group is devoid of direct interactions 
with protein in the model. An additional salt bridge may be made 
between the L-KYN amine and the side chain of Glu 102, which is 
located in a highly flexible region of KMO. However, this residue is 
often replaced by a glutamine in other KMO enzymes (Supplementary 


Figure 4 | Mechanism and importance of Arg 83. a, The proposed KMO 
L-KYN-hydroxylation mechanism based on the S. cerevisiae KMO-L-KYN 
model. After hydroxylation, the intermediate rearranges to form the 3-HK 
product. b, Enzymatic activity comparison between wild-type KMO and Arg 83 
mutants. Enzyme activity is significantly reduced following mutation (<29% 
and <1% of wild-type activity for Ala 83 and Met 83 mutant KMO enzymes, 
respectively). Inset, measurement of binding constant for KMO inhibitor 
UPF 648 by ligand perturbation of flavin fluorescence emission. Titration of 
UPF 648 to 5 11M wild-type KMO (closed squares), R83M (closed triangles) and 
R83A (closed circles) mutants resulted in perturbation of 520 nm fluorescence 
emission. Fluorescence changes at 520 nm as function of UPF 648 
concentration fitted to the Morrison equation yields an observed 


WT R83A 


R83M 
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Ky = 137.8 + 8 nM for wild type, Ky = 3.1 + 0.2 1M for R83M and 
Kg = 3.2 + 0.2 UM for R83A enzymes. Error bars represent the standard 
deviation of three replica points. AU, arbitrary units. 
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Fig. 5), suggesting that this interaction is not critical for enzyme acti- 
vity. The model places the substrate C3 atom adjacent to the flavin C4a, 
where it is poised to attack the flavin C4a—peroxide intermediate 
(Fig. 4a). 

All residues implicated by the KMO-L-KYN model as being 
involved in L-KyN binding are conserved across KMOs (Supplemen- 
tary Fig. 5). We validated this model by mutating residue Arg 83 
(replaced by Ala 83 and Met 83) and performing inhibitor binding/ 
kinetic assays (Fig. 4b). Enzyme activity is compromised following 
mutation (25% and <3% of wild-type activity for Ala83 and Met 83 
mutant KMO enzymes, respectively) as predicted by the KMO-L-KYN 
model. Mutation led to ~20-fold increase in dissociation constant (Kg) 
for the KMO-UPF 648 inhibitor complex, which implicates Arg 83 in 
inhibitor binding. 

Elucidation of the KMO crystal structure in free form and in com- 
plex with an established inhibitor is a major breakthrough for new 
KMO inhibitor design. This will permit docking screens using virtual 
compound libraries that may ultimately identify novel inhibitor scaf- 
folds. Such studies will enable the design of new inhibitors that possess 
the selectivity and affinity to open up new opportunities for thera- 
peutic intervention. Critically, this should inform the development 
of brain-penetrant KMO inhibitors. The KMO inhibitor Ro 61-8048 
and prodrug JM6 have shown preclinical promise in animal models of 
neurodegeneration* *'**°, but they do not appreciably penetrate the 
blood-brain barrier. These compounds probably confer neuroprotec- 
tion by inhibiting KMO in the blood and consequent active transport 
of L-KYN into the brain, ultimately yielding increased levels of the 
neuroprotective metabolite KYNA**”. As this neuroprotection seems 
to be independent of KMO inhibition in the brain and reduction of 
downstream neurotoxic metabolite levels’, delivery of KMO inhibitors 
directly to the CNS should provide critical additional therapeutic effi- 
cacy and broaden the scope of disorders amenable to targeting. 


METHODS SUMMARY 


Discrimination assays. Human and S. cerevisiae KMO were produced using 
baculovirus and Escherichia coli expression systems, respectively. HPLC was used 
to monitor inhibition of KMO by UPF 648 by monitoring formation of product 
3-HK across a range of inhibitor UPF 648 concentrations (0-2 1M). Flavin fluor- 
escence-emission measurements were used to determine Kg for KMO ligands 
through ligand perturbation of the fluorescence of enzyme-bound FAD. 
Detailed protocols can be found in Methods. 

Crystallization and diffraction data collection. Crystals of S. cerevisiae KMO 
were obtained using the sitting-drop vapour-diffusion technique. Details describ- 
ing the crystallogenesis can be found in Methods. Diffraction data from S. cerevi- 
siae KMO crystals were collected from single cryo-frozen samples at the Diamond 
Light Source, Harwell, UK. A single-wavelength anomalous dispersion data set at 
the selenium edge was used for initial phase determination. Data collection and 
refinement statistics are presented in Supplementary Table 3. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Cloning, expression and purification of human full-length KMO. The gene 
encoding full-length human KMO was synthesized (GeneScript) and codon opti- 
mized for overexpression in mammalian cells. The gene was subcloned into the 
baculovirus transfer vector pAcGHLT-A-glutathione S-transferase as a Ndel- 
EcoRI fragment and transfected into Hi5 cells along with linearized baculovirus 
using the following sense and antisense primers: 5’-GGCATATGCATGGACAG 
CAGCGTGATCCAGCGGAAG-3’;_ 5’-CCCGAATTCCTACCGGCTGATCAG 
GTTGCTG-3’. Hi5 cells (1.5 X 10°) were infected with recombinant virus for 
72 hat 28 °C. A WAVE Bioreactor System (GE Healthcare Life Sciences) was used 
to grow batches of 5-litres of cell culture. Cultured cells were lysed in 20 mM 
potassium phosphate buffer, pH 7.5, 10% glycerol, 0.5% n-dodecyl B-p-maltoside 
(DDM), 150 mM NaCl, 7 mM 2-mercaptoethanol and 50 1M FAD supplemented 
with protease inhibitors (Sigma-Aldrich). Soluble lysate was incubated with 3 ml 
pre-equilibrated glutathione uniflow resin (Clontech). The resin was then packed 
and washed with buffer B (20mM potassium phosphate buffer, pH 7.5, 10% 
glycerol, 0.012% DDM, 150mM NaCl, 7 mM 2-mercaptoethanol, 50 1M FAD). 
Fractions (0.5 ml) were eluted with buffer C (buffer B + 33 mM glutathione) and 
fractions containing KMO pooled, concentrated and loaded onto a Superdex 200 
(10/30) size-exclusion chromatography column. Pure KMO was pooled and 
stored at —80°C. 

Cloning, expression and purification of S. cerevisiae KMO. A synthetic gene 
(BNA4, UniProtKB accession number P38169) encoding S. cerevisiae KMO was 
codon optimized for overexpression in E. coli (GeneScript). The gene was subcloned 
into pET15b and pET24b (Merck) as a NdeI-XbaI fragment for overexpression in 
E. coli strain BL21(DE3). A deletion variant of KMO (KMO(A-394)) and single- 
amino-acid variants of this deletion (KMO(A394, R83A) and KMO(A394, R83M)) 
were generated by site-directed mutagenesis (QuikChange, Strategene, Agilent 
Technologies). Sense and antisense primers are shown in Supplementary Table 1. 
Amplification conditions were as follows: denaturation at 95 °C for 30s, followed 
by 16 cycles of amplification at 95 °C (30s), 55 °C (60s) and 68 °C (7 min). PCR 
products were incubated for 1 h to digest methylated template DNA and then trans- 
formed into XL1-Blue supercompetent cells (Stratagene, Agilent Technologies). 
Transformed bacterial colonies were isolated and mutated genes were identified 
by DNA sequencing (Eurofins MWG Operon) to verify presence of desired, and 
lack of spurious changes, to the DNA sequence. 

Transformed bacterial cells were grown overnight at 37°C in lysogeny broth 
(LB; 0.751) containing 100 pg ml” * ampicillin. Cultures were grown at 37 °C to an 
attenuance of ~0.3, and the temperature was then reduced to 27 °C. Cells were 
induced (0.1 mM isopropyl-B-p-thiogalactopyranoside; IPTG) and grown (16- 
20h) at 27 °C. Cells were lysed in buffer A (20 mM potassium phosphate buffer, 
pH7.5, 10% glycerol, 300 mM NaCl, 50 4M FAD) containing protease inhibitors. 
Soluble lysate was loaded onto a 5-ml HisTrap FF Ni Sepharose 6 Fast Flow 
column (GE Healthcare) pre-equilibrated with buffer A. Protein was eluted using 
a 50-250-mM imidazole linear gradient. Yellow fractions containing partially 
purified KMO were pooled and diluted ten times in buffer B to reduce salt and 
imidazole concentration. Protein was subsequently loaded onto a 10-ml blue 
Sepharose 6 fast flow column (GE Healthcare) equilibrated with buffer B. The 
protein was eluted using a linear gradient to 2.5M NaCl prepared in buffer B. 
Fractions containing KMO were pooled and diluted tenfold in buffer B to reduce 
the NaCl concentration. Protein was concentrated and chromatographed using a 
Superdex 200 10/300 GL column (GE Healthcare) equilibrated with buffer C (25 mM 
ammonium acetate buffer, pH 7.0, 150 mM NaCl, 7 mM 2-mercaptoethanol). Pure 
fractions were collected and stored at —80 °C. 

Incorporation of selenomethionine into S. cerevisiae KMO. Expression of 
selenomethionine-labelled protein was achieved by inhibiting methionine biosyn- 
thesis shortly before induction of KMO expression by adding high concentrations 
of isoleucine, leucine, phenylalanine, lysine and threonine to the cell culture. 
Transformed E. coli cells containing the S. cerevisiae KMO expression plasmid 
were grown in LB media at 37 °C to late exponential phase, collected and then re- 
suspended in M9 minimal media and grown until mid-log phase. At that point 
lysine, phenylalanine, threonine (100 mgl each) and selenomethionine, isoleu- 
cine, leucine and valine (50 mg 1 each) were added to the culture and induced 
15 min after addition of amino acids with 0.1 mM IPTG. The culture was grown 
for 12-16 h. Purification of selenomethionine-incorporated S. cerevisiae KMO was 
as described for conventional S. cerevisiae KMO. 

Enzyme assays. Steady-state kinetic parameters for KMO were obtained by initial 
rate measurements of enzyme activity. KMO catalyses the NADPH-dependent 
hydroxylation of L-KYN to 3-HK and the enzymatic reaction can be monitored by 
following the decrease in absorbance of NADPH at 340 nm. Rate assays were carried 
out with 1-cm path-length quartz microcuvette at 37 °C for human KMO and 30 °C 
for S. cerevisiae KMO. Pure enzyme (0.2-1.0 [.M) was added to 200 pl reaction buffer 
(20 mM potassium phosphate buffer, pH 8.0, 7 mM 2-mercaptoethanol) containing 


different concentrations of NADPH and L-KYN (Sigma-Aldrich) and the time- 
dependent absorbance change at 340nm was recorded using a using a UV-Vis 
Cary Eclipse spectrophotometer (Agilent Technologies). Assays at each reaction 
condition were performed in triplicate. Apparent Michaelis constants (K,,) for 
L-KYN and NADPH were determined by varying the concentration of the first 
substrate at a constant concentration of the second substrate and vice versa. 
Reaction data were fitted to the standard Michaelis-Menten equation using Origin 
Software (OriginLab). 

KMO inhibition by UPF 648 was measured by monitoring the formation of the 
product 3-HK by Cys reversed-phase HPLC. Enzyme (100 nM) was incubated at 
37 °C in 1 ml reaction buffer (20 mM potassium phosphate buffer, pH 8.5, 7 mM 
2-mercaptoethanol) with 2mM NADPH, 500 1M L-KYN and different concen- 
trations (0-2 |1M) of UPF 468. The reaction was stopped with 8% trifluoroacetic 
acid added at different time points during the assay. Precipitated protein was 
removed by centrifugation, and the supernatant was analysed by C,g reversed- 
phase HPLC equilibrated with 50 mM ammonium acetate, pH 3.0, containing 1% 
methanol and 0.1% heptanesulphonic acid; elution from the column was with 
50mM ammonium acetate, pH 4.5, containing 5% methanol and 0.5% heptane- 
sulphonic acid, and monitored at 229 nm. 3-HK was quantified by comparison 
with a 3-HK calibration curve. Each experiment was performed in triplicate. Data 
were fitted to the Morrison equation: 


Vo ((E] + [1] +Ki) Ve +] +Ki)* —4[E) [0] 
Vi 2|E] 


(1) 


in which vo is the enzyme activity without inhibitor, v; is the enzyme activity at 
ligand concentration I, [E] is the enzyme concentration, [I] is the inhibitor con- 
centration and K; is the inhibition constant. 

Ligand-binding assays. Fluorescence-emission measurements were performed 
using a Cary Eclipse Fluorimeter (Agilent Technologies) to determine enzyme- 
ligand Kg for KMO ligands, exploiting ligand perturbation of flavin (enzyme- 
bound FAD) fluorescence. Excitation light was provided from a xenon light source 
and excitation and emission slit widths were 5nm. The perturbation in FAD 
fluorescence emission was followed at 520 nm using an excitation wavelength of 
450 nm and excitation spectra were recorded from 470 to 700nm. L-KYN and 
UPF 648 were titrated into a 5-11M solution of KMO (20 mM potassium phosphate 
buffer, pH 8, 50 mM NaCl, 7 mM 2-mercaptoethanol) at 25 °C. Kg was calculated 
by fitting data for the difference in fluorescence emission measured for KMO- 
ligand complexes and free KMO at 520 nm divided by the maximum emission 
versus the ligand concentration (equation (2)). This equation is a variation of 
Morrison equation used in equation (1) and is used for tight-binding ligands or 
when Ky values are similar in magnitude to the concentration of enzyme used, in 
which case substantial amounts of both enzyme and ligand are consumed in 
forming the EX complex as the titration progresses. All data fitting was done using 
Origin software (OriginLab). 


Eobs ([E|+X-+Ka)—1/ (E]+X+Ka)?—4[B)X 
Emax 2|E] 


(2) 


In equation (2), Eops represents the fluorescence-emission difference at each ligand 
concentration, X; Emax is the maximum emission change at saturating ligand 
concentrations, [E] is the enzyme concentration and Kg the dissociation constant 
for the ligand bound to the enzyme. 

Hydrogen peroxide quantification. Measurements to observe the formation of 
hydrogen peroxide from the AKMO-394-UPF 648 inhibitor complex were per- 
formed using a horseradish peroxidase (HRP) assay. Steady-state assays were 
performed as described above to calculate the NADPH consumption rate in the 
presence and absence of an excess of UPF 648 (30 uM). KMO (5 tM) was incu- 
bated with 150 uM NADPH, 150 uM L-KYN, 400 1M o-dianisidine and ~5 units 
HRP in the presence and absence of excess UPF 648. The rate of oxidation of 
o-dianisidine by H,O,, catalysed by HRP, was monitored at 440nm 
(Aé 4a0nm = 11.600M_'cm7!). The rate of hydrogen peroxide production was 
compared to the rate of NADPH consumption. 

Crystallization of S. cerevisiae KMO. Initial crystals of S. cerevisiae KMO 
(KMO(A396) and KMO(A396)-Se) were obtained by mixing 200nl of 14mg 
ml ' protein in 20mM ammonium acetate, pH 7.0, 150mM NaCl and 7mM 
2-mercaptoethanol (buffer A) with 200nl of a reservoir solution containing 
0.1M sodium acetate, pH5.5, and 35% isopropanol. A second, more readily 
reproducible, crystal form was obtained with AKMO-394 and AKMO—394- 
HIS. In both cases crystals were grown by mixing 200 nl of protein (buffer A) 
with 200 nl of a reservoir solution containing 0.1 M imidazole, pH 7.8, and 11% 
w/v polyethylene glycol at 277 K. The UPF complex (AKMO-394-UPF 648) was 
also obtained as described above; however, before setting the trays the protein was 
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pre-incubated with 1 mM UPF 648 for ~30 min. All trays were incubated at 277 K, 
with crystals forming over a period of ~72h. 

Diffraction data processing, structure determination and refinement. X-ray 
diffraction data were collected at Diamond Light Source and subsequently inte- 
grated and scaled using the program XDS"". Initial phases were obtained from a 
single-wavelength anomalous dispersion data set (S1) collected at the selenium 
edge. Selenium sites were located using PHENIX AutoSol” yielding an electron 
density map that could be auto traced using PHENIX AutoBuild’’. The resulting 
model was completed through iterative rounds of rebuilding in Coot* and refine- 
ment in PHENIX”. All subsequent structures were solved by molecular replace- 
ment in Phaser*® using this initial selenomethionine-derived structure as the 
template. Structure validation with MolProbity” was integrated as part of the 
iterative rebuild and refinement procedure. 

KMO-1-KYN modelling. NAMD software** was used to perform all molecular 
dynamics simulations of the KMO-1L-KYN complex. Topology and parameters 
files for substrate and FAD were obtained using Antechamber program” using 
AM1-BCC charges”. The complex model was then immersed in a periodic water 
box (TIP3) and neutralized by adding Na” ions. Several cycles of minimizations 
(steepest descent, 10,000 steps) and molecular dynamics simulations (50 K, 20 ps) 
were performed to equilibrate the model (backbone protein atoms were kept 
fixed). Then molecular dynamics simulations were performed (310K, 1 ns) at a 
time step of 2 ps, with the protein backbone restrained on the X-ray structure 
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conformation. Individual snapshots were extracted and minimized to assess the 
KMO-L-KYN interactions. 


31. Kabsch, W. XDS. Acta Crystallogr. D 66, 125-132 (2010). 

32. McCoy, A.J., Storoni, L.C. & Read, R. J. Simple algorithm fora maximum-likelihood 
SAD function. Acta Crystallogr. D 60, 1220-1228 (2004). 

33. Terwilliger, T. C. et al. Iterative model building, structure refinement and density 
modification with the PHENIX AutoBuild wizard. Acta Crystallogr. D 64, 61-69 
(2008). 

34. Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta 
Crystallogr. D 60, 2126-2132 (2004). 

35. Adams, P. D. et a/. PHENIX: a comprehensive Python-based system for 
macromolecular structure solution. Acta Crystallogr. D 66, 213-221 (2010). 

36. McCoy, A.J. etal. Phaser crystallographic software. J. Appl. Crystallogr. 40, 658-674 

(2007). 

hen, V. B. et al. MolProbity: all-atom structure validation for macromolecular 

rystallography. Acta Crystallogr. D 66, 12-21 (2010). 

hillips, J.C. et a/, Scalable molecular dynamics with NAMD. J. Comput. Chem. 26, 

781-1802 (2005). 

‘ornell, W. D. et al. A second generation force field for the simulation of 

roteins, nucleic acids, and organic molecules. J. Am. Chem. Soc. 117, 5179-5197 

995). 

AO. Jakalian, A.,Jack, D. B. & Bayly, C. |. Fast, efficient generation of high-quality atomic 

charges. AM1-BCC model: Il. Parameterization and validation. J. Comput. Chem. 

23, 1623-1641 (2002). 


37. 


ioe) 


38. 


39. 


SO Ove 


©2013 Macmillan Publishers Limited. All rights reserved 


CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/naturel2075 


Corrigendum: Serine is a natural 
ligand and allosteric activator of 
pyruvate kinase M2 


Barbara Chaneton, Petra Hillmann, Liang Zheng, 
Agnes C. L. Martin, Oliver D. K. Maddocks, 
Achuthanunni Chokkathukalam, Joseph E. Coyle, 
Andris Jankevics, Finn P. Holding, Karen H. Vousden, 
Christian Frezza, Marc O’Reilly & Eyal Gottlieb 


Nature 491, 458-462 (2012); doi:10.1038/nature11540 


In Fig. la of this Letter, the western blot originally published in the 
PKM2 panel was a re-probe of the blot used in the PKM1 panel and the 
signal picked up was from the first antibody (PKM1) and not from 
PKM2. We have repeated the experiment on three different membranes 
with the antibodies used in the original paper for PKM1, PKM2 and 
total PKM, using actin as the loading control (Fig. 1 of this Corrigendum 
shows the corrected Fig. 1a). We apologize for this inadvertent error. 
None of the conclusions of the original Letter are altered. 
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CORRIGENDUM 
doi:10.1038/nature12078 


Corrigendum: Mitochondrial Atpifl 
regulates haem synthesis in 
developing erythroblasts 


Dhvanit I. Shah, Naoko Takahashi-Makise, Jeffrey D. Cooney, 
Liangtao Li, Iman J. Schultz, Eric L. Pierce, Anupama Narla, 
Alexandra Seguin, Shilpa M. Hattangadi, Amy E. Medlock, 
Nathaniel B. Langer, Tamara A. Dailey, Slater N. Hurst, 

Danilo Faccenda, Jessica M. Wiwczar, Spencer K. Heggers, 
Guillaume Vogin, Wen Chen, Caiyong Chen, Dean R. Campagna, 
Carlo Brugnara, Yi Zhou, Benjamin L. Ebert, Nika N. Danial, 
Mark D. Fleming, Diane M. Ward, Michelangelo Campanella, 
Harry A. Dailey, Jerry Kaplan & Barry H. Paw 


Nature 491, 608-612 (2012); doi:10.1038/nature11536 


On page 610 of this Letter, the overview comments concerning the 
action of ATP synthase should read: “Under physiological conditions, 
the F,Fo-ATP synthase generates ATP by consuming the proton 
motive force existing across the inner mitochondrial membrane, trans- 
porting H™ ions from the intermembrane space into the mitochondrial 
matrix®*’*, When respiration is inhibited or the mitochondrial mem- 
brane potential (A‘¥m) is decreased, the F,Fy-ATP synthase reverses 
the direction of H* transport, dissipating ATP in the attempt to main- 
tain the proton motive force*’’. Under these conditions, Atpifl inhibits 
the reversal of the F,Fy-ATP synthase to decrease ATP hydrolysis, 
thereby counteracting the maintenance of the mitochondrial mem- 
brane potential (A‘Ym) and proton motive force*’. This rewording 
has no effect on the data, the scientific interpretation and the resulting 
model presented, and has been corrected in the HTML and PDF versions 
of the original paper. 
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REGULATORY SCIENCE 


Researchers in the pipeline 


Prospects are starting to look bright in the growing field of science intended to aid 
regulation of food safety and drug development. 


BY AMBER DANCE 


hen the Deepwater Horizon rig 
exploded in April 2010, releasing 
nearly 5 million barrels of oil into 


the Gulf of Mexico, fisheries were closed in 
more than one-third of US Gulf waters. Scien- 
tists had to test hundreds of samples of seafood 
for hydrocarbon contamination, but existing 
chemical tests took a week to yield results. 
Researchers at the US Food and Drug 
Administration (FDA) and other agencies 
joined forces to develop a speedier assay that 
would let them rate seafood as safe or unsafe 
within two days. By refining methods that 
already existed, the agencies were able to geta 


new test up and running by July 2010. Fisher- 
ies reopened and seafood was back on menus 
weeks earlier than would have been possible 
with the original test. Such quick, collaborative 
problem-solving exemplifies the benefits of 
regulatory science done not only in government 
agencies, but also in industry and academia. 
Regulatory science is “not a new thing, it’s 
just a new term’, says Erin Wilhelm, project 
director at the Center for Excellence in Regula- 
tory Science and Innovation (CERSI) in Wash- 
ington DC, launched in 2011 by the FDA and 
Georgetown University. “It’s really an umbrella 
term for all different kinds of science that 
impact drug development or device develop- 
ment or food science or tobacco regulation.” 
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People wanting to work in regulatory science 
mostly need expertise in a relevant branch of 
science, but focused regulatory training is also 
available (see “Learning the ropes’). Jobs may 
involve reviewing applications for product 
approval, as well as a variety of research topics, 
such as identifying biomarkers for drug activity, 
gauging risks to public health during the drug- 
approval process or tracking a drug’s effects 
after it has reached the market. Companies and 
industry groups that need to gain approval for 
their products are seeking team members who 
understand the regulatory world. Jobs are avail- 
able in many countries, especially in the United 
States, where the field has garnered interest in 
government and other sectors. > 
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> Most people distinguish between regu- 
latory science and regulatory affairs, which 
focuses on making sure that regulations are 
followed. “We're talking real science,” says 
Hans-Georg Eichler, senior medical officer 
at the European Medicines Agency (EMA) 
in London. People who do regulatory science 
have diverse backgrounds, ranging from bio- 
chemistry to statistics. And they tend to think 
of themselves as statisticians or biochemists, 
not ‘regulatory scientists, he says. 

“It’s not been an area of high profile,” notes 
John Burris, president of the Burroughs Well- 
come Fund in Research Triangle Park, North 
Carolina, which funds biomedical research. 
“People aren't winning Nobel prizes in it.” 
Yet the results of regulatory science have the 
potential to make enormous impact, he says. 
For example, a job in basic research identify- 
ing biomarkers for disease could help scien- 
tists running clinical trials to identify who is 
responding well to treatment. And by testing 
drugs in model systems, researchers may be 
able to learn about the efficacy and side effects 
of a compound before taking on the cost and 
risk of human trials. 

Model systems currently under develop- 
ment by regulatory scientists include the 
‘human on a chip, which simulates ten dif- 
ferent organs for drug-safety testing. Another 
is the virtual family — anatomically correct 
digital models of male and female adults and 
children, based on body scans of volunteers 
and used in computer simulations. The vir- 
tual family is already being applied to predict 
body-temperature changes in the tissues sur- 
rounding implanted devices when people are 
exposed to radio waves or microwaves. 


OPPORTUNITIES ARISING 

Governments and institutions have started to 
take note of the field. The Burroughs Wellcome 
Fund is evaluating applications for its first Inno- 
vation in Regulatory Science Awards — grants 
for academics addressing regulatory topics 
— which it initiated to kick-start investment 
in the area. It has received applications from 
about 60 faculty members in North America, 
and it plans to endow at least five projects with 
up to US$500,000 each over five years. If the 
programme is successful, Burris hopes to make 
it an annual call for proposals. 

The FDA does not offer many research 
grants, although it partnered with the US 
National Institutes of Health in 2010 to offer 
a total of US$6.75 million for a few three-year 
projects in regulatory areas such as disease bio- 
markers. Last year, it also issued a call for exter- 
nal contractors to conduct regulatory research. 

In Japan, the Pharmaceuticals and Medical 
Devices Agency established an Office of Regu- 
latory Science in 2009. The Ministry of Health, 
Labour and Welfare offered ¥1.2 billion (US$12 
million) in grants in the regulatory field in 2012. 

In Europe, the Innovative Medicines Initia- 
tive (IMI) in Brussels supports public-private 
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collaborations on topics such as biomarkers 
and disease models in the hope of improving 
the drug-development process. 

The blooming interest in regulatory science 
has not necessarily translated into job oppor- 
tunities at the FDA or EMA, however. Eichler 
says that the EMA’s job market was growing 
rapidly until 2009, when the economy began 
its downward spiral and hiring stalled. IMI 
executive director Michel Goldman, by con- 
trast, sees opportunities worldwide, includ- 
ing in Europe, Japan and China. And a 2012 
review by the non-profit Partnership for Pub- 
lic Service in Washington DC found that the 
FDA hired 2,221 people in 2010, up from 817 
in 2007. However, many positions are tempo- 
rary. And although FDA chief scientist Jesse 
Goodman anticipates hiring more people as 
the older generation of employees retires, this 
year’s federal budget cuts have the potential 
to hit the agency hard. The FDA spends the 
majority of its money on staff and related costs. 


Although no forced unpaid leave or lay-offs are 
expected, the budget cuts could lead to delayed 
hiring and reduced travel and overtime. 


PETRI DISHES AND PAPERWORK 
A regulatory agency is the obvious — although 
not the only — place to start a career in regu- 
latory science. Some FDA employees have 
left experimentation behind, whereas others 
split their time between research projects and 
reviewing applications for product approvals. 
In some ways, working in regulation is like 
being a principal investigator, says Candice 
Jongsma, a regulatory-science fellow at the 
FDA Center for Tobacco Products in Rock- 
ville, Maryland. She may not do the experiments 
herself, but she reviews data, asks questions and 
makes recommendations. One of her projects 
is to compare the lists of ingredients in new 
and existing tobacco products and determine 
whether the new recipe raises health questions 
that will need to be addressed before approval. 


TRAINING PROGRAMMES 


Learning the ropes 


Training in regulatory science has tended 
to be an on-the-job apprenticeship, with 
the major requirement a solid background 
in science, but targeted education is also 
available. 

Fellowships can help researchers to 
explore the regulatory side of science before 
applying for a permanent position. Candice 
Jongsma is among the first Tobacco 
Regulatory Science Fellows at the US Food 
and Drug Administration (FDA) Center for 
Tobacco Products in Rockville, Maryland; 
her one-year programme is co-sponsored 
by the US Institute of Medicine. Jongsma 
applies her chemistry knowledge to look 
for safety concerns in the ingredients of 
new products, and her fellowship includes 
training courses and personal meetings with 
the secretary of Health and Human Services 
and the commissioner of the FDA. The FDA 
also sponsors a two-year Commissioner’s 
Fellowship, which may open the door to 
further work at the agency after they have 
completed the programme. 

Meanwhile, educational programmes 
focused on regulation are on the rise. 

They are often geared towards working 
professionals, and involve weekend or 
evening classes and online learning. Some 
curricula concentrate on science and drug 
development; others focus more on laws 
and regulations, or pricing and business 
decision-making, says Frances Richmond, 
director of the International Center for 
Regulatory Science at the University of 
Southern California in Los Angeles, which 
offers master’s and doctoral degrees. 
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Master’s programmes are in 
development, and are expected to open 
this year, at Georgetown University in 
Washington DC; the University of Maryland 
in Baltimore, as an online course; and the 
University of Pennsylvania in Philadelphia. 
In the Maryland programme, for example, 
students will get a grounding in regulatory 
affairs, drug discovery and development, 
clinical research and surveillance to track 
the safety of a drug after it has been 
approved. 

In Europe, students can enrol in short 
courses lasting a few days, or year-long 
master’s programmes. Many focus on 
regulatory affairs and some on regulatory 
science. The Innovative Medicines Initiative 
(IMI) in Brussels runs the postgraduate 
Pharmatrain programme, which 
includes short classes and master’s- 
level training in drug development. Its 
classes are available in several locations 
across the continent. The IMI’s European 
programme in Pharmacovigilance and 
Pharmacoepidemiology, called Eu2P, offers 
online training at the certificate, master’s 
and PhD level. 

The Japanese Pharmaceuticals and 
Medical Devices Agency has united with a 
dozen universities to allow its employees to 
earn graduate degrees and to let university 
graduate students conduct their thesis 
research at the agency. 

“Talent is hard to find,” says Richmond. 
“A person with good science skills and a 
degree in regulatory science generally has 
no problems getting a good job.” A.D. 


Ernest Kwegyir-Afful was a neuroscience 
postdoc when he applied for a two-year fel- 
lowship at the FDA in the hope of getting 
to conduct research that would have an 
immediate impact. At the FDA’ Center for 
Food Safety and Applied Nutrition in Col- 
lege Park, Maryland, Kwegyir-Afful worked 
on estimating the risk that small quantities 
of food allergens might endanger sensi- 

tive consumers. 


“It’s afield He came up with 
thatyouwould a computational 
enjoy if you method for risk 
don’t want analysis that was 
to think one- closely tailored to 


allergens. After fin- 
ishing his fellowship 
last September, Kwegyir-Afful accepted 
a permanent position at the centre, doing 
similar work. Regulatory science is “a field 
that you would enjoy if you don’t want to 
think one-dimensionally,’ he says, noting 
that he works with experts in topics such 
as toxicology or law. Communication skills 
and a team-oriented approach are essen- 
tial, says Frances Richmond, director of the 
International Center for Regulatory Science 
at the University of Southern California in 
Los Angeles. 

Regulatory-strategy units at pharmaceu- 
tical companies are good places to explore 
the regulatory side of industry, says Robert 
Meyer, who directed the FDA Office of 
Drug Evaluation before becoming vice- 
president of global regulatory strategy for 
Merck at its campus in Upper Gwynedd, 
Pennsylvania. Company regulatory sci- 
entists would do well to understand the 
relevant laws for FDA reviewers, which 
can help company scientists to do the right 
experiments and to offer the FDA the data it 
needs to make decisions, says Meyer. That is 
especially true for people working at small 
companies, because they participate in 
many aspects of product development, says 
James Polli, co-principal investigator of an 
FDA CERSI at the University of Maryland 
in Baltimore. 

Academic scientists, too, can address reg- 
ulatory questions or advise the FDA. Subha 
Madhavan, a bioinformatician at George- 
town University, is building databases that 
will be used by FDA scientists looking to 
understand why people respond to drugs 
in different ways, and how vaccines might 
be linked to autoimmune disease. She regu- 
larly meets FDA staff to explore their needs, 
and expects to publish the work. 

“Regulatory science is the field of the 
future, if you will, in terms of drug develop- 
ment and device development,” says 
Wilhelm. “At some point, it will mean some- 


2 


thing to say, ‘Iam a regulatory scientist?” = 


dimensionally.” 


Amber Dance is a freelance writer in 
Los Angeles, California. 


TURNING POINT 


CAREERS 


Adam de la Zerda 


Adam de la Zerda was on track to become 

an electrical engineer when a personal loss 
prompted him to switch to cancer biology. Now 
a structural biologist at Stanford University 

in California, de la Zerda has developed a 
technique for imaging tumour cells. 


Why did you take up cancer biology? 

In my undergraduate programme at the 
Technion Israel Institute of Technology in Haifa, 
I focused on computer science, engineering and 
physics. When I started my PhD at Stanford, I 
was working on experimental quantum phys- 
ics. But during winter break in my first year, a 
good friend passed away from cancer. That was 
a true turning point for me. I started examining 
how to apply my background to cancer research. 


Was it a difficult leap? 

No. Stanford has a flexible structure that can 
accommodate such transitions. I could have 
an adviser from anywhere in the university; I 
found an amazing one in Sanjiv Gambhir, who 
was working on molecular imaging. I knew 
that early detection of cancerous tissue could 
have prevented many cancer deaths, including 
my friend’s. Gambhir agreed to let me join his 
lab without a biology background. 


How did you persuade him? 

I think my motivation convinced him, but I 
had also read a lot about his imaging work. 
I was able to explain why I was so excited 
about the prospects for direct applications. I 
launched a project to convert light into sound 
so we could look into tissue and find out 
whether cells are cancerous. My programming 
skills helped me to create our first real imaging 
unit using lasers and ultrasound systems. 


Why did you pursue a postdoc in chemistry? 
As my PhD ended, I realized that much of what 
we do as biologists relies on chemistry. I had 
never taken a chemistry class, and wanted to 
study it. My dad has a PhD in chemistry, which 
also helped me to appreciate the subject. Luck- 
ily, Carolyn Bertozzi, a chemist at the University 
of California, Berkeley, accepted me as the first 
engineer in her lab. My knowledge of chemis- 
try was ata first-year-undergraduate level, but I 
learned a lot, often from undergraduates. 


Did you plan to do a one-year postdoc? 

No — [had a three-year fellowship. Some 
people want to stay in their field and crank 
out papers; I wanted to expand my toolkit 
with organic chemistry. After a year, I saw an 
opportunity to apply for a faculty position in 
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structural biology at Stanford. I was in no rush 
to leave the postdoc, but I felt ready. I got the 
position and started last autumn. 


How did you approach the interview? 

It is important to have an interesting story to tell 
when you apply. Given the tough job market, 
hiring committees are risk-averse. They don't 
want to hear, “I want to solve X problem.” They 
want to hear, “This is exactly how I'm going to 
solve X problem.” I feel fortunate that the search 
committee was willing to accept me, whereas 
others may have said I was at too early a stage. 


Last year, you won the Dale F. Frey Award for 
Breakthrough Scientists. How will it help you? 
It is awarded by the Damon Runyon Cancer 
Research Foundation in New York, which 
contributes 100% of the donations it receives 
to research that will affect patients. Through 
the foundation, I have met venture capitalists 
whom I am trying to bring to research brain- 
storming sessions. I want to work out how best 
to translate my group’s research and patents 
into commercial medical devices. I want the 
harsh feedback at the start of a project to help 
us to make sure that our technologies will help 
patients. We are willing to take big risks, but we 
want to make sure that there are big benefits. 


Should engineers explore life sciences? 

Yes. In the past year I have been teaching a class 
that exposes engineering students to biology; 
300 students have taken it. I show them how 
engineering tools can solve problems in biol- 
ogy and medicine, especially in cancer and 
cardiovascular disease. Several engineering 
students have joined immunology and cardi- 
ology labs. That makes me extremely proud. m 


INTERVIEW BY VIRGINIA GEWIN 
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Ua SCIENCE FICTION 


SURVIVORS AND SAVIOURS 


BY PHILIP T. STARKS 


he airlocks close behind me, but 
I can’t hear them. My ears regis- 

ter nothing but the radio and my 
breathing. White noise, the rhythmic 
acceptance and expulsion of recycled 
air, and my own inner dialogue keep 
me company as I take my first step. 
The ground compresses beneath 
my foot, but my protective gear 
mutes the full experience. 

I was selected for this mis- 
sion not because I’m the oldest 
or youngest, weakest or strong- 
est, smartest or dimmest, but 
because I’m expendable. When 
you're a dead-end vehicle in an 
evolutionary war, your job is to 
die first. I've been awarded this step 
because I’m sterile. 

Sterile not only describes me, but 
also the environment that I’ve survived 
in. I won't say, ‘lived in, although 
the others seem hopeful enough 
to equate survival with living. 

I don't. Up until now, I’ve been 
surviving. However, with each 
step taking me farther from the 
enclave into the real world, I feel more alive. 

AndI'm notalone. Out here, the world is 
teeming with life. Some teams collaborate, 
some hold ground in extended battle and 
others triumph. The skeletal remains splin- 
tering beneath my feet, serving as coral-like 
scaffolding for microbial masters, remind me 
that when one team triumphs, another fails. 
I might be advancing, but my team is losing. 

Many teams have lost, but I suppose the 
majority have survived. It’s just my own bias 
that elevates birds over biofilm. But I find no 
comfort in these thoughts. So, while plough- 
ing through this microbial minefield, I focus 
on stories of long extinct beasts: of mammals, 
birds, reptiles and insects. I recite parables 
explaining how the loss of honeybees was an 
ignored bioindicator of a New Age, and how 
financial and political interests handcuffed 
a timely response. I fill my mind with these 
details to shield me from my fears. 

My mission is to find survivors, but the 
unspoken hope is to find females. That 
hope isn’t buoyed by data or experience. Of 
almost 10,000 embryos, only 1,500 survived. 
Even this pathetic number stressed our pool 
of viable uteruses, forcing us to transfer as 
many as three to the younger soldiers. Most 
failed to implant, or perished shortly there- 
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after. Our hopes were pinned on 350 fetuses, 
280 of which survived birth; 165 enjoyed 
their ninth birthday, and prior to their tenth, 
we released them. Only 68 were female. 

Now I’m searching for at least one. I 
understood that this trip would be long, and 
that the first steps would reveal our past fail- 
ures. Despite these warnings, I didn't antici- 
pate the boneyard I'd trample. The beauty 
of the growth encompassing them notwith- 
standing, these were my kin. Still, the horror 
of the eviscerated, cleanly consumed bod- 
ies was infinitely preferable to what I feared 
came next. 

I pass through the final gateway, fright- 
ened but hopeful, but ultimately unprepared. 
My fears, developed and nourished in iso- 
lation, paled against this sight. Extinction 
nestles into one’s arms more easily than a 
dead child. And I can't possibly hold all of 
the dead children. 

Most of the food and water packets litter- 
ing the ground are untouched. I am pleased, 
actually, by the advanced decomposition: 

most died immedi- 
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long enough to rejoin us. I mourn them, 
mutter a silent prayer, and wish they were 
never born. 

I begin tallying the dead, then stop and 

radio in: “I count at least 150, ’m coming 
back? I turn down the volume and don’t 
wait for an answer. We fertilized the 
eggs, developed the embryos and 
reared the young. We took the sur- 
vivors and released them, unpro- 
tected, into this world. We killed 
them: I just couldn't count every 
last dead child. 

If had, my count would have 
fallen short. My walk back to my 
sterile tomb was not solitary, and 
before I reached the airlock I felt 

a delicate hand tug my gloved 
finger. 
“Dont leave me.” 
She is beautiful: eleven years old, 
fit, and without protective gear. Her 
words are stronger than she knows: 
with them, she has sealed both our 
fates. lam overcome with joy and 
grief: 1am thrilled she breathes, but 

I wish I was dead. 

“I wont leave you,” I promised 
her. “Where are your friends?” 

“Gone;’ she said. I understood. She was 
the lone survivor of our challenge test. She 
has endured our abuse, and now I'll deliver 
her back to her abusers. 

“So many people are looking forward to 
meeting you!” I confess to her the truth, but 
not its depth. I know that delivering her to 
the airlock will kill her freedom. She has 
survived, but only to be an egg donor. The 
goal of our research has always been to pro- 
duce a generation that could thrive without 
protection. If she carries alleles that protect 
her against extinction-level pathogens, we 
need them. 

I bring her to the airlock, call in the news 
and lie to her. “I'm going to step aside and 
protect the site, so I know you make it in” 
As she enters our crypt, I remove my mask, 
remember those we saved and mourn those 
we killed. As a dead-end vehicle, lam happy 
to reach my end. I breathe deeply, taste the 
Age of Microbes, and convince myself that 
the Age of Man is not over. m 
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